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Abstract 

This Chapter expounds the modern theory of symplectic reduction in finite- 
dimensional Hamiltonian mechanics. This theory generalizes the well-known con- 
nection between continuous symmetries and conserved quantities, i.e. Noether's 
theorem. It also illustrates one of mechanics' grand themes: exploiting a sym- 
metry so as to reduce the number of variables needed to treat a problem. The 
exposition emphasises how the theory provides insights about the rotation group 
and the rigid body. The theory's device of quotienting a state space also casts 
light on philosophical issues about whether two apparently distinct but utterly 
indiscernible possibilities should be ruled to be one and the same. These issues 
are illustrated using "relationist" mechanics. 

Mottoes 

The current vitality of mechanics, including the investigation of fundamental questions, 
is quite remarkable, given its long history and development. This vitality comes about 
through rich interactions with pure mathematics (from topology and geometry to group 
representation theory), and through new and exciting applications to areas like control 
theory. It is perhaps even more remarkable that absolutely fundamental points, such 
as a clear and unambiguous linking of Lie's work on the Lie-Poisson bracket on the 
dual of a Lie algebra ... with the most basic of examples in mechanics, such as the 
rigid body and the motion of ideal fluids, took nearly a century to complete. 
Marsden and Ratiu (1999, pp. 431-432). 

In the ordinary theory of the rigid body, six different three-dimensional spaces IR^, IR^*, q, 
g*,TGg,T*Gg are identified. 
Arnold (1989, p. 324). 
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1 Introduction 



1.1 Why classical mechanics? 

All hail the rise of modern physics! Between 1890 and 1930, the quantum and rela- 
tivity revolutions and the consolidation of statistical physics through the discovery of 
atoms, utterly transformed our understanding of nature; and had an enormous influ- 
ence on philosophy; (e.g. Kragh 1999; Ryckman 2005). Accordingly, this Handbook 
concentrates on those three pillars of modern physics — quantum theories, spacetime 
theories and thermal physics. So some initial explanation of the inclusion of a Chapter 
on classical mechanics, indeed the classical mechanics of finite-dimensional systems, is 
in order. 

The first point to make is that the various fields of classical physics, such as mechan- 
ics and optics, are wonderfully rich and deep, not only in their technicalities, but also 
in their implications for the philosophy and foundations of physics. Prom Newton's 
time onwards, classical mechanics and optics have engendered an enormous amount 
of philosophical reflection. As regards mechanics, the central philosophical topics are 
usually taken (and have traditionally been taken) to be space, time, determinism and 
the action-at-a-distance nature of Newtonian gravity. Despite their importance, I will 
not discuss these topics; but some other Chapters will do so (at least in part, and 
sometimes in connection with theories other than classical mechanics). I will instead 
focus on the theory of symplectic reduction, which develops the well-known connection 
between continuous symmetries and conserved quantities, summed up in Noether's 
"first theorem". I choose this focus partly by way of preparation for parts of some 
other Chapters; and partly because, as we will see in a moment, symplectic reduction 
plays a central role in the current renaissance of classical mechanics, and in its relation 
to quantum physics. 

I said that classical physics engendered a lot of philosophical reflection. It is worth 
stressing two, mutually related, reasons for this: reasons which today's philosophical 
emphasis on the quantum and relativity revolutions tends to make us forget. 

First: in the two centuries following Newton, these fields of classical physics were 
transformed out of all recognition, so that the framework for philosophical reflection 
about them also changed. Think of how in the nineteenth century, classical mechanics 

and optics gave rise to classical fleld theories, especially electromagnetism. And within 
this Chapter's spcciflc fleld, the classical mechanics of flnite-dimcnsional systems, think 
of how even its central theoretical principles were successively recast, in fundamental 
ways, by flgures such Euler, Lagrange, Hamilton and Jacobi. 

Second, various difficult problems beset the attempt to rigorously formulate classi- 
cal mechanics and optics; some of which have considerable philosophical aspects. It is 

not true that once we set aside the familiar war-horse topics — space, time, determinism 
and action-at-a-distance — the world-picture of classical mechanics is straightforward: 
just "matter in motion". On the contrary. Even if we consider only finite- dimensional 
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systems, we can ask, for example: 

(i) For point-particles (material points): can they have different masses, and if 
so how? What happens when they collide? Indeed, for point-particles interacting only 
by Newtonian gravity, a collision involves infinite kinetic energy. 

(ii) For extended bodies treated as finite-dimensional because rigid: what hap- 
pens when they collide? Rigidity implies that forces, and displacements, are transmit- 
ted "infinitely fast" through the body. Surely that should not be taken hterally? But 
if so, what justifies this idealization; and what are its scope and limits? 

As to infinite-dimensional systems (clastic solids, fiuids and fields), many parts of 
their theories remain active research areas, especially as regards rigorous formulations 
and results. For contemporary work on elastic sohds, for example, cf. Marsden and 
Hughes (1982). As to fluids, the existence and uniqueness of rigorous solutions of the 
main governing equations, the Navier-Stokes equations, is still an open problem. This 
problem not only has an obvious bearing on determinism; it is regarded as scientifi- 
cally significant enough that its solution would secure a million-dollar Clay Millennium 
prize. 

These two reasons — the successive reformulations of classical mechanics, and its 
philosophical problems — are of course related. The monumental figures of classical 
mechanics recognized and debated the problems, and much of their technical work 
was aimed at solving them. As a result, there was a rich debate about the founda- 
tions of classical physics, in particular mechanics, for the two centuries after Newton's 
Principia (1687). A well-known example is Duhem's instrumentalist philosophy of 
science, which arose in large measure from his realization how hard it was to secure 
rigorous foundations at the microscopic level for classical mechanics. A similar exam- 
ple is Hubert's being prompted by his contemporaries' continuing controversies about 
the foundations of mechanics, to choose as the sixth of his famous list of outstand- 
ing mathematical problems, the axiomatization of mechanics and probability; (but for 
some history of this list, cf. Grattan-Guinness (2000)). A third example, spanning 
both centuries, concerns variational principles: the various principles of least action 
formulated first by Maupertuis, then by Euler and later figures — first for finite classi- 
cal mechanical systems, then for infinite ones — prompted much discussion of teleology. 
Indeed, this discussion ensnared the logical empiricists (Stoltzner 2003); it also bears 
on contemporary philosophy of modality (Butterfield 2004). 

In the first half of the twentieth century, the quantum and relativity revolutions 
tended to distract physicists, and thereby philosophers, from these and similar prob- 
lems. The excitement of developing the new theories, and of debating their implications 
for natural philosophy, made it understandable, even inevitable, that the foundational 
problems of classical mechanics were ignored. 

Besides, this tendency was strengthened by the demands of pedagogy: the neces- 
sity of including the new theories in physics undergraduate degrees. By mid-century, 
the constraints of time on the physics curriculum had led many physics undergradu- 
ates' education in classical mechanics to finish with the elementary parts of analytical 



mechanics, especially of finite-dimensional systems: for example, with the material 
in Goldstein's well-known textbook (1950). Such a restriction is understandable, not 
least because: (i) the elementary theory of Lagrange's and Hamilton's equations re- 
quires knowledge of ordinary differential equations, and (ii) elementary Hamiltonian 
mechanics forms a springboard to learning elementary canonical quantization (as does 
Hamilton- Jacobi theory, from another perspective). Besides, as I mentioned: even this 
restricted body of theory provides plenty of material for philosophical analysis — witness 
my examples above, and the discussions of the great figures such Euler, Lagrange, 
Hamilton and Jacobi. 

However, the second half of the twentieth century saw a renaissance in research in 
classical mechanics: hence my first motto. There are four obvious reasons for this: the 
first two "academic" , and the second two "practical" . 

(i) : Thanks partly to developments in mathematics in the decades after Hilbert's 
list of problems, the foundational questions were addressed afresh, as much by math- 
ematicians and mathematically-minded engineers as by physicists. The most relevant 
developments lay in such fields as topology, differential geometry, measure theory and 
functional analysis. In this revival, the contributions of the Soviet school, always strong 
in mechanics and probability, were second to none. And relatedly: — 

(ii) : The quest to deepen the formulation of quantum theory, especially quantum 
field theory, prompted investigation of (a) the structure of classical mechanics and (b) 
quantization. For both (a) and (b), special interest attaches to the generally much 
harder case of infinite systems. 

(iii) : The coming of spaceflight, which spurred the development of celestial mechan- 
ics. And relatedly: — 

(iv) : The study of non-linear dynamics ("chaos theory"), which was spurred by the 
invention of computers. 

With these diverse causes and aspects, this renaissance continues to flourish — and 
accordingly, I shall duck out of trying to further adumbrate it! I shall even duck out 
of trying to survey the philosophical questions that arise from the various formulations 
of mechanics from Newton to Jacobi and Poincare. Suffice it to say here that to the 
various topics mentioned above, one could add, for example, the following two: the 
flrst broadly ontological, the second broadly epistemological. 

(a) : The analysis of notions such as mass and force (including how they change 
over time). For this topic, older books include Jammer (1957, 1961) and McMuUin 
(1978); recent books include Boudri (2002), Jammer (2000), Lutzen (2005) and Slovik 
(2002); and Grattan-Guinness (2006) is a flne recent synopsis of the history, with many 
references. 

(b) : The analysis of what it is to have an explicit solution of a mechanical problem 
(including how the notion of explicit solution was gradually generalized) . This topic is 
multi-faceted. It not only relates to the gradual generalization of the notion of function 
(a grand theme in the history of mathematics — well surveyed by Lutzen 2003), and 
to modern non-linear dynamics (cf. (iv) above). It also relates to the simpliflcation 
of problems by exploiting a symmetry so as to reduce the number of variables one 



needs — and this is the core idea of symplectic reduction. I turn to introducing it. 
1.2 Prospectus 

The strategy of simphfying a mechanical problem by exploiting a symmetry so as 
to reduce the number of variables is one of classical mechanics' grand themes. It is 
theoretically deep, practically important, and recurrent in the history of the subject. 
The best-known general theorem about the strategy is undoubtedly Noether's theo- 
rem, which describes a correspondence between continuous symmetries and conserved 
quantities. There is both a Lagrangian and a Hamiltonian version of this theorem, 
though for historical reasons the name 'Noether's theorem' is more strongly attached 
to the Lagrangian version. However, we shall only need the Hamiltonian version of the 
theorem: it will be the "springboard" for our exposition of symplectic reduction.^ 

So I shall begin by briefly reviewing the Hamiltonian version in Section 12.11 For 
the moment, suffice it to make four comments (in ascending order of importance for 
what follows): 

(i) : Both versions are underpinned by the theorems in elementary Lagrangian and 
Hamiltonian mechanics about cyclic (ignorable) coordinates and their corresponding 
conserved momenta.^ 

(ii) : In fact, the Hamiltonian version of the theorem is stronger. This reflects the 
fact that the canonical transformations form a "larger" group than the point trans- 
formations. A bit more precisely: though the point transformations g — g' on the 
configuration space Q induce canonical transformations on the phase space F of the gs 
and ps, q ^ q',p ^ p' , there are yet other canonical transformations which "mix" the 
gs and ps in ways that transformations induced by point transformations do not. 

(iii) : I shall limit our discussion to (a) time- independent Hamiltonians and (b) 
time-independent transformations. Agreed, analytical mechanics can be developed, in 
both Lagrangian and Hamiltonian frameworks, while allowing time-dependent dynam- 
ics and transformations. For example, in the Lagrangian framework, allowing velocity- 

^For discussion of the Lagrangian version, cf. e.g. Brading and Castellani (this vol., ch. 13) 
or (restricted to finite-dimensional systems) Butterfield (2004a: Section 4.7). For an exposition of 
both versions that is complementary to this paper (and restricted to finite-dimensional systems), cf. 
Butterfield (2006). Brading and Castellani also bring out that, even apart from Noether's theorems 
in other branches of mathematics, there are other 'Noether's theorems' about symmetries in classical 
dynamics; so the present theorem is sometimes called Noether's "first theorem". Note also (though I 
shall not develop this point) that symplectic structure can be seen in the classical solution space of 
Lagrange's equations, so that symplectic reduction can be developed in the Lagrangian framework; 
cf. e.g. Marsden and Ratiu (1999: p. 10, Sections 7.2-7.5, and 13.5). 

^Here we glimpse the long history of our subject: these theorems were of course clear to these 
subjects' founders. Indeed the strategy of exploiting a symmetry to reduce the number of variables 
occurs already in 1687, in Newton's solution of the Kepler problem; (or more generally, the problem 
of two bodies exerting equal and opposite forces along the line between them). The symmetries are 
translations and rotations, and the corresponding conserved quantities are the linear and angular 
momenta. In what follows, these symmetries and quantities will provide us with several examples. 



7 



dependent potentials and-or time-dependent constraints would prompt one to use what 
is often called the 'extended configuration space' Q x IR. And in the Hamiltonian frame- 
work, time-dependence prompts one to use an 'extended phase space' F x IR. Besides, 
from a philosophical viewpoint, it is important to consider time-dependent transfor- 
mations: for they include boosts, which are central to the philosophical discussion of 
spacetime symmetry groups, and especially of relativity principles. But beware: rough- 
and-ready statements about symmetry, e.g. that the Hamiltonian must be invariant 
under a symmetry transformation, are liable to stumble on these transformations. To 
give the simplest example: the Hamiltonian of a free particle is just its kinetic energy, 
which can be made zero by transforming to the particle's rest frame; i.e. it is not 
invariant under boosts. 

So a full treatment of symmetry in Hamiltonian mechanics, and thereby of sym- 
plectic reduction, needs to treat time-dependent transformations — and to beware! But 
I will set aside all these complications. Here it must suffice to assert, without any 
details, that the modern theory of symplectic reduction does cope with boosts; and 
more generally, with time-dependent dynamics and transformations. 

(iv): As we shall see in detail, there are three main ways in which the theory of 
symplectic reduction generalizes Noether's theorem. As one might expect, these three 
ways are intimately related to one another. 

(a) : Noether's theorem is "one-dimensional" in the sense that for each sym- 
metry (a vector field of a special kind on the phase space), it provides a conserved 
quantity, i.e. a real-valued function on the phase space, whose value stays constant 
over time. So in particular, different components of a conserved vector quantity, such 
as total linear momentum, are treated separately; (in this example, the corresponding 
vector fields generate translations in three different spatial directions). But in sym- 
plectic reduction, the notion of a momentum map provides a "unified" description of 
these different components. 

(b) : Given a symmetry, Noether's theorem enables us to confine our attention 
to the level surface of the conserved quantity, i.e. the sub-manifold of phase space on 
which the quantity takes its initial value: for the system's time-evolution is confined 
to that surface. In that sense, the number of variables we need to consider is reduced. 
But in symplectic reduction, we go further and form a quotient space from the phase 
space. That is, in the jargon of logic: we define on phase space an equivalence relation 
(not in general so simple as having a common value for a conserved quantity) and form 
the set of equivalence classes. In the jargon of group actions: we form the set of orbits. 
Passage to this quotient space can have various good technical, and even philosophi- 
cal, motivations. And under good conditions, this set is itself a manifold with lower 
dimension. 

(c) : Hamiltonian mechanics, and so Noether's theorem, is usually formulated 
in terms of symplectic manifolds, in particular the cotangent bundle T*Q of the con- 
figuration space Q. f Section l2.ll will give details.) But in symplectic reduction, we 
often need a (mild) generalization of the idea of a symplectic manifold, called a Pois- 
son manifold, in which a bracket, with some of the properties of the Poisson bracket. 
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is taken as the primitive notion. Besides, this is related to (b) in that we are often 
led to a Poisson manifold, and dynamics on it, by taking the quotient of a symplectic 
manifold (i.e. a phase space of the usual kind) by the action of a symmetry group. 

As comment (iv) hints, symplectic reduction is a large subject. So there are several 
motivations for expounding it. As regards physics, many of the ideas and results can 
be developed for finite-dimensional classical systems (to which I will confine myself), 
but then generalized to infinite-dimensional systems. And in either the original or the 
generalized form, they underpin developments in quantum theories. So these ideas and 
results have formed part of the contemporary renaissance in classical mechanics; cf. (i) 
and (ii) at the end of Section 11.11 

As regards philosophy, symmetry is both a long-established focus for philosophi- 
cal discussion, and a currently active one: cf. Brading and Castellani (2003). But 
philosophical discussion of symplectic reduction seems to have begun only recently, 
especially in some papers of Belot and Earman. This delay is presumably because the 
technical material is more sophisticated: indeed, the theory of symplectic reduction 
was cast in its current general form only in the 1970s. But as Belot and Earman em- 
phasise, the philosophical benefits are worth the price of learning the technicalities. 
The most obvious issue is that symplectic reduction's device of quotienting a state 
space casts light on philosophical issues about whether two apparently distinct but 
utterly indiscernible possibilities should be ruled to be one and the same. In Section 
121 I will follow Belot in illustrating this issue with "relationist" mechanics. Indeed, I 
have selected the topics for my exposition with an eye to giving philosophical readers 
the background needed for some of Belot 's discussions. His papers (which I will cite 
in Section 121) make many judicious philosophical points, without burdening the reader 
with an exposition of technicalities: excellent stuff — but to fully appreciate the issues, 
one of course has to slog through the details. 

Finally, in the context of this volume, symplectic reduction provides some back- 
ground for the Chapters on the representation of time in mechanics (Belot, this vol., 
ch. 2), and on the relations between classical and quantum physics (Landsman, this 
vol., ch. 5, especially Sections 4.3-4.5 and 6.5; Dickson, this vol., ch. 4). 

The plan of the Chapter is as follows. I first review Noether's theorem in Hamil- 
tonian mechanics as usually formulated, in Section 12.11 Then I introduce the themes 
mentioned in (b) and (c) above, of quotienting a phase space, and Poisson manifolds 
f Section 12. 2|) : and illustrate these themes with "relationist" mechanics f Section 12. 3|) . 

Thereafter, I expound the basics of symplectic reduction: (confining myself to finite- 
dimensional Hamiltonian mechanics). Section by Section, the plan will be as follows. 
Sections El and m review the modern geometry that will be needed. Section El is mostly 
about Frobenius' theorem. Lie algebras and Lie groups.'^ Section HI expounds Lie group 
actions. It ends with the central idea of the co-adjoint representation of a Lie group 
G on the dual q* of its Lie algebra. This review enables us to better understand the 

"^Its first two Subsections also provide some pre-requisites for Malament (this vol.). 
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motivations for Poisson manifolds ()5.1|) : and then to exhibit examples, and prove some 
main properties fSection l5.2l onwards). Section El applies this material to symmetry and 
conservation in mechanical systems. In particular, it expresses conserved quantities as 
momentum maps, and proves Noether's theorem for Hamiltonian mechanics on Poisson 
manifolds. Finally, in Section [71 we prove one of the several main theorems about 
symplectic reduction. It concerns the case where the natural configuration space for 
a system is itself a Lie group G: this occurs both for the rigid body and ideal fluids. 
In this case, quotienting the natural phase space (the cotangent bundle on G) gives a 
Poisson manifold that "is" the dual q* of G's Lie algebra.^ 

To sum up: — The overall effect of this exposition is, I hope, to illustrate this 
Chapter's mottoes: that classical mechanics is alive and kicking, not least through 
deepening our understanding of time-honoured systems such as the rigid body — whose 
analysis in traditional textbooks can be all too confusing! 

2 Symplectic reduction: an overview 

We begin by briefly reviewing Hamiltonian mechanics and Noether's theorem, in Sec- 
tion EUl^ This prepares us for the idea of symplectic reduction. Section IT^ which we 
then illustrate using "relationist" mechanics. Section 12.31 

2.1 Hamiltonian mechanics and Noether's theorem: a review 

2.1.1 Symplectic manifolds; the cotangent bundle as a symplectic manifold 

A symplectic structure or symplectic form on a manifold M is defined to be a differential 
2-form uj ow M that is closed (i.e. its exterior derivative du; vanishes) and is non- 
degenerate. That is: for any x G M, and any two tangent vectors at x, o", r G T^: 

dcu = and V r ^ 0, 3o- : cu(r,a)^0 . (2.1) 

Such a pair (M, uo) is called a symplectic manifold. There is a rich theory of symplectic 
manifolds; but we shall only need a small fragment of it. (In particular, the fact that 
we mostly avoid the theory of canonical transformations means we will not need the 
theory of Lagrangian sub- manifolds.) 

^In this endeavour, my sources are four books by masters of the subject: Abraham and Marsden 
(1978), Arnold (1989), Marsden and Ratiu (1999) and Olver (2000). But again, be warned: my 
selection is severe, as anyone acquainted with these or similar books will recognize. 

^For more details about differential geometry, cf. Sections 13.11 and 13.21 For more details about 
the geometric formulation of mechanics, cf. Arnold (1989) or Marsden and Ratiu (1999); or Singer 
(2001) (more elementary than this exposition) or Abraham and Marsden (1978) (more advanced); or 
Butterfield (2006) (at the same level). Of many good textbooks of mechanics, I admire especially 
Desloge (1982) and Johns (2005). 
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First, it follows from the non-degeneracy of uj that M is even- dimensional. The 
reason lies in a theorem of linear algebra, which one then applies to the tangent space 
at each point of M. Namely, for any bilinear form u; : K x — > H: if is antisymmetric 
of rank r < m = dim(V^), then r is even. That is: r = 2n for some integer n, and there 
is a basis ei, Cj, of for which u has a simple expansion as wedge-products 

u = E^^ie'Ae'+" ; (2.2) 

equivalently, u has the m x m matrix 



w = -1 . (2.3) 




where 1 is the nxn identity matrix, and similarly for the zero matrices of various sizes. 
This normal form of antisymmetric bilinear forms is an analogue of the Gram-Schmidt 
theorem that an inner product space has an orthonormal basis, and is proved by an 
analogous argument. 

So if an antisymmetric bilinear form is non-degenerate, then r = 2n = m. That is: 
eq. 12.31 loses its bottom row and right column consisting of zero matrices, and reduces 
to the 2n x 2n symplectic matrix uo given by 



Second, the non-degeneracy of uj implies that at any x e M, there is a basis- 
independent isomorphism uJ' from the tangent space T^. to its dual T*. Namely: for 
any x E M and r E T^, the value of the 1-form uj^{t) G T* is defined by 

cu^(r)(a) ■.= uj{a,T) \/a eT^ . (2.5) 

This also means that a symplectic structure enables a covector field, i.e. a differential 
one-form, to determine a vector field. Thus for any function : M — IR, so that dH 
is a differential 1-form on M, the inverse of iJ" (which we might write as a;^), carries 
dH to a vector field on M, written Xh- This is the key idea whereby in Hamiltonian 
mechanics, a scalar function H determines a dynamics; cf. Section r2. 1.21 

So far, we have noted some implications of uj being non-degenerate. The other part 
of the definition of a symplectic form (for a manifold), viz. uj being closed, duj = 0, is 
also important. We shall see in Section [2.1.31 that it implies that a vector field X on a 
symplectic manifold M preserves the symplectic form uj (i.e. in more physical jargon: 
generates (a one-parameter family of) canonical transformations) iff X is Hamiltonian 
in the sense that there is a scalar function / such that X = Xf = uj'^{df). Or in terms 
of the Poisson bracket, with • representing the argument place for a scalar function: 
X(-)=X;(.) = {-,/}. 
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So much by way of introducing symplectic manifolds. I turn to showing that any 
cotangent bundle T*Q is such a manifold. That is: it has, independently of a choice of 
coordinates or bases, a symplectic structure. 

Given a manifold Q (dim(Q)=n) which we think of as the system's configuration 
space, choose any local coordinate system g on Q , and the natural local coordinates 
g,p thereby induced on T*Q. We define the 2-form 

dp A dq := dpi A dq' := T.'^^^dpi A dq' . (2.6) 

In fact, eq. 12.61 defines the same 2-form, whatever choice we make of the chart q on Q. 
For dpAdq is the exterior derivative of a 1-form on T*Q which is defined naturally (i.e. 
independently of coordinates or bases) from the derivative (also known as: tangent) 
map of the projection 

TV : {q,p) eT*Q^ qeQ. (2.7) 

Thus consider a tangent vector r (not to Q, but) to the cotangent bundle T*Q at a 
point f] = {q,p) G T*Q, i.e. q E Q and p G T*. Let us write this as: r G Tr^{T*Q) = 
T(^q^p)(T*Q). The derivative map, Dn say, of the natural projection vr applies to r: 

Dn-.re T(,,p)(T*g) ^ iDn{r)) G T, . (2.8) 

Now define a 1-form 6h on T*Q by 

: r G T(,,p)(T*Q) ^ p{D7t{t)) G 1R ; (2.9) 

where in this definition of 6^, p is defined to be the second component of r's base-point 
{q,p) G T*Q; i.e. r G T(,,p)(T*Q) and p G T;. 

This 1-form is called the canonical 1-form on T*Q. One now checks that in any 
natural local coordinates q, p, 6h is given by 

OH=Pidq\ (2.10) 

Finally, we define a 2-form by taking the exterior derivative of 6h'- 

d{9H) ■■= d{pidq') = dpi A dq' . (2.11) 

One checks that this 2-form is closed (since = 0) and non-degenerate. So (T*Q, (1(9^)) 
is a symplectic manifold. Accordingly, d{6H), or its negative — d(6'//), is called the 

canonical symplectic form, or canonical 2-form. 

There is a theorem (Darboux's theorem) to the effect that locally, any symplectic 
manifold "looks like" a cotangent bundle: or in other words, a cotangent bundle is 
locally a "universal" example of symplectic structure. We will not go into details; but 
in Section 15. 3. 41 we will discuss the generalization of this theorem for Poisson manifolds. 
But first we review, in the next two Subsections, Hamilton's equations, and Noether's 
theorem. 
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2.1.2 Geometric formulations of Hamilton's equations 



As we already emphasised, the main geometric idea behind Hamilton's equations is that 
a gradient, i.e. covector, field dH determines a vector field Xh- So to give a geometric 
formulation of Hamilton's equations at a point x = {q,p) in a cotangent bundle T*Q, 
let us write cu" for the (basis-independent) isomorphism from the cotangent space to 
the tangent space, T* —>■ T^, induced by u := — d(^^i^) = rfg* A dpi (cf eq. 12. Sj) . Then 
Hamilton's equations may be written as: 

X = Xh{x) = J{dH{x)) = J{dH{x)) . (2.12) 

There are various other formulations. Applying u^, the inverse isomorphism ^ T*, 
to both sides, we get 

u^Xnix) = dH{x) . (2.13) 
In terms of the symplectic form at x, this is: for all vectors t & 

uj{Xh{x),t) =dH{x)-T ; (2.14) 

or in terms of the contraction (also known as: interior product) ixa of a differential 
form a with a vector field X, with ■ marking the argument place of r G T^.: 

ix^u := uiXnix), ■) = dH{x)i-) . (2.15) 

More briefiy, and now written for any function /, it is: 

ixfu; = df . (2.16) 

Finally, recall the relation between the Poisson bracket and the directional derivative 
(or the Lie derivative C) of a function: viz. 

Cx,9 = dg{Xj) = Xf{g) = {g, /} . (2.17) 

Combining this with eq. 12.161 we can state the relation between the symplectic form 
and Poisson bracket in the form: 

{gj} = dg{Xf) = ixjdg = ix,(ix,^) = ^(X„X^) . (2.18) 
2.1.3 Noether's theorem 

The core idea of Noether's theorem, in both the Lagrangian and Hamiltonian frame- 
works, is that to every continuous symmetry of the system there corresponds a con- 
served quantity (a first integral, a constant of the motion). The idea of a continuous 
symmetry is made precise along the following lines: a symmetry is a vector field on 
the state-space that (i) preserves the Lagrangian (respectively, Hamiltonian) and (ii) 
"respects" the structure of the state-space. 

1"? 



In the Hamiltonian framework, the heart of the proof is a "one-hner" based on the 
fact that the Poisson bracket is antisymmetric. Thus for any scalar functions / and H 
on a symplectic manifold (M, cj) (and so with a Poisson bracket given by eq. I2.18|) . we 
have that at any point x G M 

Xf{H){x) = {HJ}{x)=Q iff Q = {f,H}{x)=XHU){^) . (2.19) 

In words: around x, H is constant under the flow of the vector field Xf (i.e. under 
what the evolution would be if / was the Hamiltonian) iff / is constant under the flow 
Xh. Thinking of H as the physical Hamiltonian, so that Xh represents the real time- 
evolution (sometimes called: the dynamical flow), this means: around x, Xf preserves 
the Hamiltonian iff / is constant under time-evolution, i.e. / is a conserved quantity 
(a constant of the motion). 

But we need to be careful about clause (ii) above: the idea that a vector field re- 
spects" the structure of the state-space. In the Hamiltonian framework, this is made 
precise as preserving the symplectic form. Thus we define a vector field X on a symplec- 
tic manifold (M, uj) to be symplectic (also known as: canonical) iff the Lie-derivative 
along X of the symplectic form vanishes, i.e. Cx^ = 0. (This definition is equivalent 
to X's generating (active) canonical transformations, and to its preserving the Poisson 
bracket. But I will not go into details about these equivalences: for they belong to the 
theory of canonical transformations, which, as mentioned, I will not need to develop.) 

We also define a Hamilton system to be a triple {M,u,H) where (M, a;) is a sym- 
plectic manifold and H : M —>■ ]R, i.e. M G J-'{M). And then we define a (continuous) 
symmetry of a Hamiltonian system to be a vector field X on M that: 

(i) preserves the Hamiltonian function, CxH = 0; and 

(ii) preserves the symplectic form, Cx^^ = 0. 

These definitions mean that to prove Noether's theorem from eq. 12.191 it will suffice 
to prove that a vector field X is symplectic iff it is locally of the form Xf. Such a vector 
field is called locally Hamiltonian. (And a vector field is called Hamiltonian if there is 
a global scalar / : M — > IR such that X = Xf.) In fact, two results from the theory of 
differential forms, the Poincare Lemma and Cartan's magic formula, make it easy to 
prove this; (for a vector field on any symplectic manifold {M,uj), i.e. {M,uj) does not 
need to be a cotangent bundle). 

Again writing d for the exterior derivative, we recall that a fc-form a is called: 

(i) : exactif there is a (fc — l)-form (3 such that a = d/3; (cf. the elementary definition 
of an exact differential); 

(ii) : closed if da = 0. 

The Poincare Lemma states that every closed form is locally exact. To be precise: for 
any open set U of M, we define the vector space Q^{U) of fc-form fields on U. Then 
the Poincare Lemma states that if a G fi'^(M) is closed, then at every x G M there is 
a neighbourhood U such that a It/ G Q''{U) is exact. 

Cartan's magic formula is a useful formula (proved by straightforward calculation) 
relating the Lie derivative, contraction and the exterior derivative. It says that if X 
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is a vector field and a a fc-form on a manifold M, then the Lie derivative of a with 
respect to X (i.e. along the flow of X) is 

Cxa = dixd + ixda . (2.20) 

We now argue as follows. Since uj is closed, i.e. da; = 0, Cartan's magic formula, 
eq. 12.201 applied to u becomes 

Cxuj = dix^ + ixduj = dix^ ■ (2-21) 

So for X to be symplectic is for ix^ to be closed. But by the Poincare Lemma, if ix^ 
closed, it is locally exact. That is: there locally exists a scalar function / : M — ^ IR 
such that 

ixuj = df i.e. X = Xf . (2.22) 
So for X to be symplectic is equivalent to X being locally Hamiltonian. 
Thus we have 

Noether's theorem for a Hamilton system If X is a symmetry of 
a Hamiltonian system {M,uj,H), then locally X = Xf] so by the anti- 
symmetry of the Poisson bracket, eq. I2.19[ / is a constant of the motion. 
And conversely: if / : M — IR is a constant of the motion, then Xf is a 
symmetry. 

We will see in Section 16.21 that most of this approach to Noether's theorem, in 
particular the "one-liner" appeal to the anti-symmetry of the Poisson bracket, eq. 12.191 
carries over to the more general framework of Poisson manifolds. For the moment, we 
mention an example (which we will also return to). 

For most Hamiltonian systems in euclidean space IR'^, spatial translations and ro- 
tations are (continuous) symmetries. Let us consider in particular a system we will 
discuss in more detail in Section ITSl N point-particles interacting by Newtonian grav- 
ity. The Hamiltonian is a sum of two terms, which are each individually invariant 
under translations and rotations: 

(i) a kinetic energy term K; though I will not go into details, it is in fact defined 
by the euclidean metric of IR'^, and is thereby invariant; and 

(ii) a potential energy term V; it depends only on the particles' relative distances, 
and is thereby invariant. 

The corresponding conserved quantities are the total linear and angular momen- 
tum.^ 

'^By the way, this Hamiltonian is not invariant under boosts. But as I said in (iii) of Section [1.21 I 
restrict myself to time- independent transformations; the treatment of symmetries that "represent the 
relativity of motion" needs separate discussion. 



2.2 The road ahead 



In this Subsection, four comments will expand on the introductory comment (iv) of 
Section 11.21 and also give some information about the history of symplectic reduction 
and about some crucial examples. 

(1): Generalizing from Noether's theorem; Poisson manifolds: — 
Noether's theorem tells us that a continuous symmetry, i.e. a one-parameter group of 
symmetries, determines a first integral (i.e. a constant of the motion). So a larger group 
of symmetries, i.e. a group with several parameters, implies several first integrals. The 
phase flow is therefore confined to the intersection of the level surfaces of these integrals: 
an intersection which is in general a manifold. In other words: the simultaneous level 
manifold of these integrals is an invariant manifold of the phase flow. 

It turns out that, in many useful cases, this manifold is also invariant under an 
appropriately chosen subgroup of the group of symmetries; and that the quotient space, 
i.e. the set of orbits under the action of this subgroup, is a manifold with a natural 
structure induced by the original Hamiltonian system that is sufficient to do mechanics 
in Hamiltonian style. The quotient space is therefore called the 'reduced phase space'. 

But in some cases, this natural structure is not a symplectic form, but a (mild) 
generalization in which the the form is allowed to be degenerate; i.e. like eq. 12.31 
rather than eq. 12.41 A manifold equipped with such a structure need not be a quotient 
manifold. It can instead be defined in terms of a generalization of the usual Poisson 
bracket, as defined in terms of the symplectic form by eq. 12.181 

The key idea is to postulate a bracket, acting on the scalar functions F : M — IR on 
any manifold M, and possessing four properties enjoyed by the usual Poisson bracket. 
One of the properties is anti-symmetry, emphasised in Section l!^.1.3[ s proof of Noether's 
theorem. The other three are that the postulated bracket, again written {, }, is: to be 
bilinear; to obey the Jacobi identity for any real functions F, G, H on M, i.e. 

{{F, H}, G} + {{G, F}, H} + {{H, G}, F} = ; (2.23) 

and to obey Leibniz' rule for products, i.e. 

{F,H-G} = {F,H}-G + H ■ {F, G} . (2.24) 

We will see in Section |S] that such a bracket, again called 'Poisson bracket', provides 
a sufficient framework for mechanics in Hamiltonian style. In particular, it induces an 
anti-symmetric bilinear form that may be degenerate, as in eq. 12.31 A manifold M 
equipped with such a bracket is called a Poisson manifold. 

The allowance of degeneracy means that a Poisson manifold can have odd dimension; 
while we saw in Section l!^.l.ll that any symplectic manifold is even-dimensional. On the 
other hand, this generalized Hamiltonian mechanics will have clear connections with 
the usual formulation of Section 12.11 The main connection will be the result that any 
Poisson manifold M is a disjoint union of even-dimensional manifolds, on which M's 
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degenerate antisymmetric bilinear form restricts to be non-degenerate.^ 

(2) : Historical roots: — 

The theory of symplectic reduction has deep historical roots in the work of classical me- 
chanics' monumental figures. In part, this is no surprise. As mentioned in (i) of Section 
11.21 cyclic coordinates underpin the role of symmetry in mechanics, and in particular 
Noether's theorem. And Newton's solution of the Kepler problem provides an exam- 
ple: witness textbooks' expositions of the transition to centre-of-mass coordinates, and 
of polar coordinates with the angle being cyclic (yielding angular momentum as the 
conserved quantity). So it is unsurprising that various results and ideas of symplectic 
reduction can be seen in the work of such masters as Euler, Lagrange, Hamilton, Ja- 
cobi. Lie and Poincare; for example (as we will see), in Euler's theory of the rigid body. 

But the history also holds a surprise. It turns out that Lie's epoch-making work on 
Lie groups already contained a detailed development of much of the general, modern 
theory.^ The sad irony is that most of Lie's insights were not taken up — and were 
then repeatedly re-discovered. So this is yet another example (of which the history of 
mathematics has so many!) of the saying that he who does not learn from history is 
doomed to repeat it. The consolation is of course that it is often easier, and more fun, 
to re-discover something than to learn it from another... 

Thus it was only from the mid-1960s that the theory, in essentially the form Lie 
had, was recovered and cast in the geometric language adopted by modern mechanics; 
namely, by contemporary masters such as Arnold, Kostant, Marsden, Meyer, Smale, 
Souriau and Weinstein; (cf. this Chapter's first motto). Happily, several of these mod- 
ern authors are scholars of the history, and even their textbooks give some historical 
details: cf. Marsden and Ratiu (1999, pp. 336-8, 369-370, 430-432), and the notes to 
each Chapter of Olver (2000: especially p.427-428). (Hawkins (2000) is a full history 
of Lie groups from 1869 to 1926; for Lie, cf. especially its Sections 1.3, 2.5 and Chapter 
3, especially 3.2.) 

In any case, setting history aside: symplectic reduction has continued since the 
1970s to be an active research area in contemporary mechanics, and allied fields such 
as symplectic geometry. So it has now taken its rightful place as a major part of the 
contemporary renaissance of classical mechanics: as shown by ... 

(3) : Two examples: the rigid body and the ideal fluid: — 

Two examples illustrate vividly how symplectic reduction can give new physical un- 
derstanding, even of time-honoured examples: the rigid body and the ideal fluid — as 
attested by this Chapter's mottoes. (Section 12.31 will develop a third example, more 
closely related to philosophy.) 

As to the rigid body: we will see (especially in Section that symplectic reduction 
considerably clarifies the elementary theory of the rigid body (Euler's equations, Euler 

^Because of these clear connections, it is natural to still call the more general framework 'Hamil- 
tonian'; as is usually done. But of course this is just a verbal matter. 

^The main source is his (1890). Besides, Arnold (1989: 456) reports that the prototype example 
of a Poisson manifold, viz. the dual of a finite-dimensional Lie algebra, was already understood by 
Jacobi. 
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angles etc.): which, notoriously, can be all too confusing! For simplicity, I shall take 
the rigid body to be pivoted, so as to set aside translational motion. This will mean 
that the group of symmetries defining the quotienting procedure will be the rotation 
group. It will also mean that the rigid body's configuration space is given by the 
rotation group, since any configuration can be labelled by the rotation that obtains it 
from some reference-configuration. So in this application of symplectic reduction, the 
symmetry group (viz. the rotation group) will act on itself as the configuration space. 
This example will also give us our prototype example of a Poisson manifold. 

As to the ideal fluid, i.e. a fluid that is incompressible and inviscid (with zero 
viscosity): this is of course an infinite-dimensional system, and so (as I announced in 
Section II. 2^ outside the scope of this Chapter. So I will not go into any details, but 
just report the main idea. 

The equations of motion of an ideal fluid, Euler's equations, are usually derived 
either by applying Newton's second law F = ma to a small fluid element; or by a 
heuristic use of the Lagrangian or Hamiltonian approach (as in heuristic classical field 
theories). But in the mid-1960s, Arnold showed how the latter derivations could be 
understood in terms of a striking, even beautiful, analogy with the above treatment 
of the rigid body. Namely, the analogy shows that the configuration space of the 
fluid is an infinite-dimensional group; as follows. The configuration of an ideal fluid 
confined to some container occupying a volume V C IR^ is an assignment to each 
spatial position x G of an infinitesimal fluid element. Given such an assignment 
as a reference-configuration, any other configuration can be labelled by the volume- 
preserving diffeomorphism d from V to V that carries the reference-configuration to 
the given one, by dragging each fluid element along by d. So given a choice of reference- 
configuration, the fluid's configuration space is given by the infinite-dimensional group 
X> of diffeomorphisms d : V —>■ V: just as the rotation group is the configuration 
space of a (pivoted) rigid body. T> then forms the basis for rigorous Lagrangian and 
Hamiltonian theories of an ideal fluid. 

These theories turn out to have considerable analogies with the Lagrangian and 
Hamiltonian theories of the rigid body, thanks to the fact that in both cases the sym- 
metry group forms the configuration space. In particular, Euler's equations for ideal 
fluids are the analogues of Euler's equations for a rigid body. Besides, these rigorous 
theories of fluids (and symplectic reduction applied to them) are scientifically impor- 
tant: they have yielded various general theorems, and solved previously intractable 
problems. (For more details, cf. Abraham and Marsden (1978: Sections 4.4 and 4.6 
for the rigid body, and 5.5.8 for the ideal fluid), Arnold (1989: Appendix 2:C to 2:F 
for the rigid body, and 2:G to 2:L for the ideal fluid), and Marsden and Ratiu (1999: 
Chapters 1.4 and 15 for the rigid body, and 1.5, p. 266, for the ideal fluid).) 

(4): Philosophical importance: — 
Symplectic reduction is also, I submit, philosophically important; in at least two ways. 
The first way is specific: it illustrates some methodological morals about how classical 
mechanics analyses problems. I develop this theme in (Butterfield 2005). The second 
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way is more general: the theory, or rather various apphcations of it, is directly relevant 
to disputes in the philosophy of space and time, and of mechanics. This relevance 
is recognized in contemporary philosophy of physics. So far as I know, the authors 
who develop these connections in most detail are Belot and Earman. They discuss 
symplectic reduction in connection with such topics as: 

(i) the treatment of symmetries, including gauge symmetries; 

(ii) the dispute between absolute and relationist conceptions of space and time; and 

(iii) the interpretation of classical general relativity (a topic which connects (i) and 
(ii), and bears on heuristics for quantum gravity). 

Thus Belot (1999, 2000, 2001, 2003, 2003a) and Earman (2003) discuss mainly (i) and- 
or (ii); Belot and Earman (2001) discusses (iii). For (i) and (ii), I also recommend 
Wallace (2003). 

But these papers have a demanding pre-requisite: they invoke, but do not expound, 
the theory of symplectic reduction. They also discuss infinite-dimensional systems 
(especially classical electromagnetism and general relativity), without developing finite- 
dimensional examples like the rigid body. Indeed, there is, so far as I know, no article- 
length exposition of the theory which is not unduly forbidding for philosophers. So I 
aim to give such an exposition, to help readers of papers such as those cited. 

As an appetizer for this exposition, I will first (in Section 12. 3^ follow Belot in 
presenting the general features of a finite-dimensional symplectic reduction which has 
vivid philosophical connections, viz. to the absolute vs. relationist debate. This 
example concerns a system of point-particles in Euclidean space, either moving freely 
or interacting by a force such as Newtonian gravity. (The symmetries defining the 
quotienting procedure are given by the Euclidean group of translations and rotations.) 
For philosophers, this will be a good appetizer for symplectic reduction, since it sheds 
considerable light on relationism about space of the sort advocated by Leibniz and 
Mach. 

2.3 Appetizer: Belot on relationist mechanics 
2.3.1 Comparing two quotienting procedures 

In several papers, Belot discusses how symplectic reduction bears on the absolute-vs.- 
relational debate about space. I shall pick out one main theme of his discussions: the 
comparison of a relational classical mechanical theory with what one gets by quoti- 
enting the orthodox absolutist (also called a 'substantivalist') classical mechanics, by 

^°As I said in Section H"^ my material is drawn from the books by Abraham and Marsden, Arnold, 
Marsden and Ratiu, and Olver. More precisely, I will mostly draw on: Abraham and Marsden (1978: 
Sections 3.1-3.3, 4.1-4.3), Arnold (1989: Appendices 2, 5 and 14), Marsden and Ratiu (1999: Chapters 
9-13) and Olver (2000: Chapter 6). And much of what follows — in spirit, and even in letter — is already 
in Lie (1890)! As a (non-philosophical) introduction to symplectic reduction, I also recommend Singer 
(2001). It is at a yet more elementary level than what follows; e.g. it omits Poisson manifolds and 
co-adjoint representations. 
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an appropriate symmetry group. His main contention — which I endorse — is that this 
comparison sheds considerable hght on relationism: on both its motivation, and its 
advantages and disadvantages.^^ 

Belot's overall idea is as follows. Where the relationist admits one possible config- 
uration, as (roughly) a specification of all the distances (and thereby angles) between 
all the parts of matter, the absolutist (or substantivalist) sees an infinity of possibili- 
ties: one for each way the relationist's configuration (a relative configuration) can be 
embedded in the absolute space. This makes it natural to take the relationist to be 
envisaging a mechanics which is some sort of "quotient" of the absolutist's mechanics. 

In particular, on the traditional conception of space as Euclidean (modelled by H^), 
each of the relationist's relative configurations corresponds to an equivalence class of 
absolutist configurations (i.e. embeddings of arrangements of matter into H'^), with 
the members of the class related by spatial translations and rotations, i.e. elements of 
the Euclidean group. In the jargon of group actions, to be developed in Section EJ the 
Euclidean group acts on the set of all absolutist configurations, and a relative configu- 
ration corresponds to an orbit of this action. So it is natural to take the relationist to 
be envisaging a mechanics which quotients this action of the Euclidean group, to get 
a relative configuration space. A relationist mechanics, of Lagrangian or Hamiltonian 
type, is then to be built up on this space of relative configurations. 

But as Belot emphasises, one can instead consider quotienting the absolutist's state- 
space — i.e. in a Hamiltonian framework, the phase space — rather than their configura- 
tion space. Indeed, this is exactly what one does in symplectic reduction. In particular, 
the Euclidean group's action on the absolutist's configuration space, Q say, can be lifted 
to give an action on the cotangent bundle T*Q; which is accordingly called the 'cotan- 
gent lift'. One can then take the quotient, i.e. consider the orbits into which T*Q is 
partitioned by the cotangent lift. 

We thus have two kinds of theories to compare: (i) the relationist theories, built up 
from the relative configuration space; which for the sake of comparison with symplectic 
reduction we take to be Hamiltonian, rather than Lagrangian; (ii) theories obtained 
by quotienting "later", i.e. quotienting the absolutist's cotangent bundle. 

I will now spell out this comparison. But I will not try to summarize Belot's more 
detailed conclusions, about what such a comparison reveals about the advantages and 
disadvantages of relationism. They are admirably subtle, and so defy summary: they 
can mainly be found at his (2000: p. 573-574, 582; 2001: Sections VIII to X). (Rovelh 
(this volume) also discusses relationism.) 

As befits an appetizer, I will also (like Belot) concentrate on as simple a case as 
possible: a mechanics of N point-particles, which is to assume a Euclidean spatial 
geometry. Of course, the absolutist make this assumption by postulating a Euclidean 
space; but for the relationist, the assumption is encoded in constraints relating the 

"The main references are Belot (1999, 2001, 2003: Sections 3.5, 5). Cf. also his (2000: Sections 
4 to 5.3), (2003a: Section 6). Though I recommend all these papers, the closest template for what 
follows is (2001: Section VI et seq.). 
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various inter-particle distances. The main current example of a relationist mechanics 
of such a system is due to Barbour and Bertotti (1982), though they develop it in the 
Lagrangian framework; (to be precise, in terms of Jacobi's principle). Belot also dis- 
cusses other relational theories, including field theories, i.e. theories of infinite systems; 
some of them also due to Barbour, and in a Lagrangian framework. But in this Section 
I only consider point-particles. 

Also, I will also not discuss boosts, though of course the relationist traditionally 
proposes to identify any two absolutist states of motion related by a boost. In terms 
of group actions, this means I will consider quotienting by an action of the euclidean 
group, but not the Galilei group. (Cf. how I set aside time- dependent transformations 
already in (iii) of Section [TT21) I will also postpone to later Sections technical details, 
even when our previous discussion makes them accessible. 

Finally, a warning to avoid later disappointment! The later Sections will not in- 
clude a full analysis of the euclidean group's actions on configuration space and phase 
space, and their quotients. That would involve technicalities going beyond an appe- 
tizer. Instead (as mentioned at the end of Section IT!^ . the material in later Sections 
is chosen so as to lead up to Section [7|s theorem, the Lie-Poisson reduction theorem, 
about quotienting the phase space of a system whose configuration space is a Lie group. 
Further reasons for presenting the material for this theorem will be given in Section 
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2.3.2 The spaces and group actions introduced 

Let us begin by formulating the orthodox absolutist mechanics of point-particles 
interacting by Newtonian gravity, together with the action of the Euclidean group. 

Each point-particle occupies a point of H^, so that the configuration space Q is IR^^: 
dim((5) = 3A^. So the phase space for Hamiltonian mechanics will be the cotangent 
bundle T*Q 3 {q,p): dim(T*Q) = QN. 

The Hamiltonian is a sum of kinetic and potential terms, K and V. K depends 
only on the ps, and V only on the gs. In cartesian coordinates, with i now labelling 
particles i = 1, ...,N rather than degrees of freedom, we have the familiar expressions: 



H{q,p) = K{p)+V{q) with K = , V{q) = G S.^, „ „ (2.25) 



2mi II Qi - ^.j 

where rrii are the masses and G is the gravitational constant. ^^■^'^ 



^^From the broader philosophical perspective, the most significant feature of eq. 12.251 is no doubt 
the fact that the potential is a sum of all the two-body potential energies for the configuration q £ Q: 
there are no many-body interactions. 

^^Incidental remark. In fact, the kinetic energy can be represented by a metric g on the configuration 
space. For Hamiltonian mechanics, this means that the kinetic energy scalar K on the cotangent 
bundle T*Q can be defined by applying Q's metric g to the projections of the momenta p, where at 
each point {q,p) G T*Q the projection is made with the preferred isomorphism : T* — s- Tq\ (cf. eq. 
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The euclidean group E (aka: -^(3)) is the group (under composition) of transla- 
tions, rotations and reflections on IR^. But since we will be interested in continuous 
symmetries, we will ignore reflections, and so consider the subgroup of orientation- 
preserving translations and rotations; i.e. the component of the group connected to 
the identity transformation (which I will also write as E). This is a Lie group, i.e. a 
group which is also a manifold, with the group operations smooth with respect to the 
manifold structure. Section |21 will give formal details. Here we just note that we need 
three real numbers to specify a translation (x = (x, y,z)), and three to specify a rota- 
tion (two for an axis, and one for the angle through which to rotate); and accordingly, 
it is unsurprising that as a manifold, the dimension of is 6: dim(ii^) = 6. 

E acts in the obvious sense on IR^. For example, if g G E is translation by x G H^, 
g induces the map q G IR^ i— q + x. Similarly for a rotation induces: again. Section 121 
will give a formal definition. 

Now let E act in this way on each of the N factor spaces IR^ of our system's 
configuration manifold Q = IR'^^. This defines an action $ on Q: i.e. for all g E E, 
there is a map : Q ^ Q. For example, for g = a translation by x G IR^, we have 

$3 : (qj) = (qi, q^) G Q ^ (qi + X, q^ + x) G g ; (2.27) 

and similarly for rotations. Since the potential function V : Q — IR of eq. I2.25l depends 
only on inter-particle distances, each map $g : Q — > Q is a symmetry of the potential; 
i.e. we have V{^g{q)) = V{q). 

The action $ (i.e. the assignment g E E ^ $g) induces an action of E on T*Q = 
T*1R^^, called the cotangent lift of $ to T*Q, and usually written as $*; so that we 
have for each g E E a. lifted map $* : T*Q T*Q. Again, the details can wait till 
later (Section^. But the idea is that each map $g on Q is smooth, and so maps curves 
to curves, and so vectors to vectors, and so covectors to covectors, and so on. 

Unsurprisingly, each of the lifted maps $* : T*Q —>■ T*Q leaves the potential V , now 
considered as a scalar on T*Q, invariant: i.e. we have \^($*(g,p)) = V{q,p) = V{q). 
But furthermore, each of the lifted maps $* is a symmetry of the Hamilton system, in 
our previous sense (Section |2.1.3|) . That is: $* preserves the Hamiltonian (indeed the 
kinetic and potential terms are separately invariant); and it preserves the symplectic 
structure. This means the dynamics is invariant under the action of all g E G: the 
dynamical histories of the system through {q,p) and through $*(g,p) match exactly 
at each time. They are qualitatively indistinguishable: in contemporary metaphysical 
jargon, they are duplicates. 

At this point, of course, we meet the absolute- vs. -relational debate about space. The 
absolutist asserts, and the relationist denies, that there being two such indistinguishable 
possibilities makes sense. So the relationist, presented with the theory above, says 

[TT^ . That is:— 

K : iq,p) 6 T*Q ^ gg{Jip),Jip)) . (2.26) 
"'^^The locus classicus for this debate is of course the Leibniz- Clarke correspondence, though the 
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we should cut down the space of possibihties. As I said in Section 12.3.11 it is natural 
to make this precise in terms of quotienting the action of the euclidean group: a set of 
absolutist possibilities related one to another by elements of the euclidean group form 
an equivalence class (an orbit) which is to represent one relationist possibility. 

But here we need to distinguish two different quotienting procedures. I will call 
them Relationism and Reductionism (with capital R's), since the former is close to both 
traditional and contemporary relationist proposals, and the latter is an example of the 
orthodox idea of symplectic reduction. As I said in Section 12.3.11 the main difference 
will be that: 

(i) : Relationism performs the quotient on E^s action on the configuration space Q; 
the set of orbits form a relative configuration space, on which the relationist proposes 
to build a dynamics, whether Lagrangian or Hamiltonian — yielding in the latter case, 
a relative phase space; whereas 

(ii) : Reductionism performs the quotient on E's action on the usual phase space 
T*Q, the set of orbits forming a reduced phase space. 

Since our discussion adopts the Hamiltonian framework, it will not matter for what 
follows, that Relationism, as defined, can adopt the Lagrangian framework, while Re- 
ductionism is committed to the Hamiltonian one. What will matter is that (i) and (ii) 
make for phase spaces of different dimensions; the reduced phase space has six more 
dimensions than the relative phase space. The "dimension gap" is six. 

We will see that four of the six variables that describe these dimensions are con- 
stants of the motion; the other two vary with time. And for certain choices of values 
of the constants of the motion (roughly: no rotation), the time- varying variables drop 
out, and the dynamics according to the Reductionist theory simplifies so as to coincide 
with that of the Relationist theory. In other words: if we impose no rotation, then the 
heterodox Relationist dynamics matches the conventional Reduced dynamics. 

2.3.3 The Relationist procedure 

The Relationist seeks a mechanics based on the relative configuration space (RCS). An 
element of the RCS is to be a pattern of inter-particle distances and angles that is 
geometrically possible, i.e. compatible with the N particles being embedded in IR^. 
So, roughly speaking, an element of the RCS is a euchdean configuration, modulo 
isometrics; and the RCS will be the set of orbits IR^^/i?. 

Even before giving a more precise statement, we can state the "punchline" about 
dimensions, as follows. Since dim{E) = 6, quotienting by E subtracts six dimensions: 

protagonists' argumentation is of course sometimes theological. Clarke the absolutist maintains that 
there are many possible arrangements of bits of matter in space consistent with a specification of all 
relative distances, saying 'if [the mere will of God] could in no case act without a pre-determining 
cause ... this would tend to take away all power of choosing, and to introduce fatality.' Leibniz claims 
there is only one such arrangement: 'those two states ... would not at all differ from one another. 
Their difference therefore is only to be found in our chimerical supposition of the reality of space in 
itself.' 
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that is, the dimension of the RCS will be 3A^-6. 

But we need to be more precise about the RCS. For the orbits and quotient spaces 
to be manifolds, and for dimensions to add or subtract in this simple way, we need to 
excise two classes of "special" points from IR'^^, before we quotient. (But I postpone 
till Section m the technical rationale for these excisions.) 

Let 5q C 1R^^ be the set of configurations which are symmetric: i.e. each is fixed 
by some element of E (other than the identity element!). Any configuration in which 
all the point-particles are coUinear provides an example: the configuration is fixed by 
any rotation about the line as axis. Let Ag be the set of collision configurations; i.e. 
configurations in which two or more particles are coincident in the usual configuration 
space IR'^^. (The Q subscripts will later serve as a reminder that these sets are sets 
of configurations.) 5q and Ag are both of measure zero in IR^^. Excise both of them, 
and call the resulting space, which is again of dimension 3A^: Q := IR^^ - {5q U Ag). 

5q and Ag are each closed under the action of E. That is, each is a union of orbits: 
a euclidean transformation of a symmetric (collision) configuration is also symmetric 
(collision). So E acts on Q. Now quotient Q hy E. Q/E is the Relationist's RCS. 
Since dim{E) = 6, we have: dim{Q/E) = 3N-6. 

These 3N — Q variables encode all of a (relative) configuration's particle-pair relative 
distances, Vij G IR (with i,j labelling particles). Note that there are N{N — l)/2 such 
relative distances; and for > 4, this is greater than 3A^ — 6: (for >> 4, it is much 
greater). So the relative distances, though physically intuitive, give an over-complete 
set of coordinates on Q/E. (So they cannot be freely chosen: there are constraints 
between them.) 

So the Relationist seeks a mechanics that uses this RCS. Newton's second law 
being second-order in time means that she will also need quantities like velocities (in 
a Lagrangian framework) or like momenta (in a Hamiltonian framework). For the 
former, she will naturally consider the A^(A^ — l)/2 relative velocities Vij := ^Tij] and 
for the latter, the corresponding momenta pij := Again, she must beware of 

constraints. The tangent and cotangent bundles built on her RCS Q/E will each have 
dimension 2(3A^ - 6) = 6A^ - 12. So again, for A^ > 4, the number A^(A^ - l)/2 of 
relative velocities r^j, or of relative momenta Pij, is greater than the number of degrees 
of freedom concerned; and for A^ >> 4, it is much greater. So again, the relative 
velocities or relative momenta are over-complete: there are constraints. 

On the other hand, if the Relationist uses only these relative quantities, Vij and 
either fij or pij (or "equivalent" coordinates on T{Q/E) or T*{Q/E) that are not over- 
complete), she faces a traditional problem — whatever the other details of her theory. At 
least, she faces a problem if she hopes for a deterministic theory which is empirically 
equivalent to the orthodox absolutist theory. I will follow tradition and state the 
problem in terms of relative velocities rather than momenta. 

The problem concerns rotation; (and herein lies the strength of Newton's and 
Clarke's position in the debate against Leibniz). For according to the absolutist theory 
two systems of point-particles could match with respect to all relative distances and 
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relative velocities, and yet have different future evolutions; so that a theory allowing 
the same possibilities as the absolutist one, yet using only these relative quantities (or 
"equivalent" variables) would have to be indeterministic. 

The simplest example is an analogue for point-particles of Newton's two globes 
thought-experiment. Thus the systems could each consist of just two point-particles 
with zero relative velocity. One system could be non-rotating, so that the point- 
particles fall towards each other under gravity; while the other system could be rotat- 
ing about an axis normal to the line between the particles, and rotating at just such a 
rate as to balance the attractive force of gravity. 

The Relationist has traditionally replied that they do not hope for a theory empir- 
ically equivalent to the absolutist one. Rather, they envisage a mechanics in which, of 
the two systems mentioned, only the non-rotating evolution is possible: more gener- 
ally, a mechanics in which the universe as a whole must have zero angular momentum. 
Originally, in authors like Leibniz and Mach, this reply was a promissory note. But 
modern Relationist theories such as Barbour and Bertotti's (1982) have made good 
the promise; and they have been extended well beyond point-particles interacting by 
Newtonian gravity. Besides, since the universe seems in fact to be non-rotating, these 
theories can even claim to be empirically adequate, at least as regards this principal 
difference from absolutist theories. 

But it is not my brief to go into these theories' details, except by way of comparison 
with a quotiented version of the absolutist theory: cf. Section 12.3.41 

2.3.4 The Reductionist procedure 

The Reductionist's main idea is to quotient only after passing to the orthodox phase 
space for point-particles, i.e. the cotangent bundle T*1R of IR .So the idea is 
to consider {T*JR^^)/E, i.e. the quotient of T*1R^^ by the cotangent-lifted action 
of the euclidean group E. 

More precisely, we again proceed by first excising special points that would give 
technical trouble. But now the points to be excised are in the cotangent bundle T*1R^^, 
not in IR .So let 6 C T*1R be the set of phase space states whose configurations are 
symmetric (in the sense of Section IZ!O s 6q). Let A C T^IR^^ be the set of collision 
points; i.e. states in which two or more particles are coincident in the configuration 
space IR^^. Both 5 and A are of measure zero. Excise both of them, and call the 
resulting phase space, which is again of dimension 6A^: M := T*1R^^ - (5 U A). 

6 and A are each closed under the cotangent-lifted action of E on T*1R^^. That is, 
each is a union of orbits: the cotangent lift of a euclidean transformation acting on a 
phase space state with a symmetric (collision) configuration yields a state which also 

^^An advocate of the absolutist theory might say that it is odd to make what seems a contingent 
feature of the universe, non-rotation, a principle of mechanics; and the Relationist might reply that 
their view has the merit of predicting that the universe does not rotate! I fear there are no clear 
criteria for settling this methodological dispute; anyway, I will not pursue it. 



has a symmetric (collision) configuration. So E acts on M. Now quotient M by E, 
getting M := M/E. This is called reduced phase space. We have: dim(M) = dim(M) 
- dim(E) = 6A^ - 6. 

As emphasised at the end of Section 12.3.21 M has six more dimensions than the 
corresponding Relationist phase space (whether the velocity phase space (tangent bun- 
dle) or the momentum phase space (cotangent bundle)). The dimension of those phase 
spaces is 2(3iV — 6) = 6A^ — 12. Indeed, we can better understand both the reduced 
phase space M and Relationist phase spaces by considering this "dimension gap". 
There are two extended comments to make. 

(1) : Obtaining the Relationist phase space: — 

We can obtain the Relationist momentum phase space from our original phase space 
M. Thus let Mo be the subspace of M in which the system has total linear momentum 
and total angular momentum both equal to zero. Since these are constants of the 
motion, Mq is dynamically closed and so supports a Hamiltonian dynamics given just 
by restriction of the original dynamics. With linear and angular momentum each 
contributing three real numbers, dim(Mo) = dim(M) - 6 = 6A^ — 6. Furthermore, 
Mo is closed under (is a union of orbits under) the cotangent-lifted action of E. So 
let us quotient Mo by this action of E, and write Mq '■= Mo/E. Then dim(Mo) = 
6N - Q - 6 = QN - 12. 

Now recall that this is the dimension of the phase space of the envisaged Relationist 
theory built on the RCS Q/E. And indeed, as one would hope: Mo is the Hamiltonian 
version of Barbour and Bertotti's 1982 Relational theory; (recall that they work in a 
Lagrangian framework). 

That is: Mq is a symplectic manifold, and points in Mo are parametrized by all the 
particle-to-particle relative distances and relative velocities. There is a deterministic 
dynamics which matches that of the original absolutist theory, once the original dy- 
namics is projected down to Section 12231s relative configuration space Q/E. 

In short: the vanishing total linear and angular momenta mean that an initial 
state comprising only relative quantities is sufficient to determine all future relative 
quantities. 

(2) : Decomposing the Reductionist reduced phase space: — 

Let us return to the reduced phase space M. The first point to make is that since 
the Hamiltonian H on M, or indeed on T*IR^^, is invariant under the cotangent- 
lifted action of E, the usual dynamics on M projects down to M = M/E. That is: 
the reduced phase space dynamics captures all the E'-invariant features of the usual 
dynamics. 

In fact, M is a Poisson manifold. So it is our first example of the more general 
framework for Hamiltonian mechanics announced in (1) of Section Again, I post- 
pone technical detail till later (especially Sections l5.ll and I5.2.4|) . But the idea is that 
a Poisson manifold has a degenerate antisymmetric bilinear map, which implies that 
the manifold is a disjoint union of symplectic manifolds. Each symplectic manifold is 
called a leaf oi the Poisson manifold. The leaves' symplectic structures "mesh" with 
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one another; and within each leaf there is a conventional Hamiltonian dynamics. 

Even without a precise definition of a Poisson manifold, we can describe how M 
is decomposed into symplectic manifolds, each with a Hamiltonian dynamics. Recall 
that we have: dim(M) = dim(M) - dim(£') = 6A^ — 6. This breaks down as: 

6A^ - 6 = (6A^ - 12) + 3 + 3 = 2(3A^ -6) + 3 + 3=:a + /? + 7 (2.28) 

where the right hand side defines a,/?, 7 respectively as 2(3A^ — 6), 3 and 3. In terms 
of M, this means the following. 

(i) : a corresponds to (l)'s Mq, i-e. to T*{Q/E). As discussed, 3A^ — 6 variables 
encode all the particle-pair relative distances; and the other 3iV — 6 variables encode 
all the particle-pair relative momenta. 

The six extra variables additional to these QN-12 relative quantities consist of: four 
constants of the motion, and two other variables which are dynamical, i.e. change in 
time. 

(ii) : P stands for three of the four constants of the motion: viz. the three variables 
that encode the total linear momentum of the system, i.e. the momentum of the centre 
of mass. These constants of the motion are "just parameters" in the sense that: (a) 
not only does specifying a value for all three of them fix a surface, i.e. a (6iV — 9)- 
dimensional hypersurface in M, on which there is a Hamiltonian dynamics; also (b) 
this Hamiltonian and symplectic structure is independent of the values we specify. 

(iii) : 7 stands for the three variables that encode the total angular momentum of 
the system. One of these is a fourth constant of the motion, viz. the magnitude L 
of the total angular momentum. The other two time-varying quantities fix a point on 
a sphere (2-sphere) of radius L, encoding the direction of the angular momentum of 
the system in a frame rotating with it. The situation is as in the elementary theory 
of the rigid body: though the total angular momentum relative to coordinates fixed 
in space is a constant of the motion (three constant real numbers), the total angular 
momentum relative to the body is constant only in magnitude (one real number L) , not 
in direction. This will be clearer in Section El onwards, when we describe the Poisson 
manifold structure in the theory of the rigid body. For the moment, there are two main 
comments to make about the particle system: — 

(a): If we specify L, in addition to the momentum of the centre of mass of the 
system, we get a (6A^ — 10)-dimensional hypersurface in M, on which (as in (ii)) there 
is a Hamiltonian dynamics. So we can think of M as consisting of the four real- 
parameter family of these hypersurfaces, with each point of each hypersurface being 
equipped with a sphere of radius L; (subject to a qualification in (b) below). 

Note that here 'each point being equipped' does not mean that the sphere gives 

-'^^As mentioned at the end of Section Fi. 3. II the relationist traditionally proposes to identify abso- 
lutist states of motion that differ just by the value of the total momentum. And indeed, the proposal 
can be implemented by considering an action of the Galilean group on the absolutist phase space 
M, and identifying points related by Galilean boosts. For discussion and references, cf. Belot (2000: 
Section 5.3). 
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the extra dimensions that would constitute M as a fibre bundle; (there would be two 
dimensions lacking). Rather: in the point's representation by 6A^ — 10 real numbers, 
two of the numbers can be taken to represent a point on a sphere. 

(b): But unlike the situation for (3 in (ii) above, the Hamiltonian dynamics on 
such a hypersurface depends on the value of L. In particular, if L = the sphere 
representing the body angular momentum is degenerate: it is of radius zero, and the 
other two time- varying quantities drop out. A point in the hypersurface is represented 
by 6A^ — 12 real numbers; i.e. the hypersurface is 6A^ — 12-dimensional. 

Now recall from Section f2.3.3l or (1) above that 6A^ — 12 is the dimension of the 
phase space of the envisaged Relationist theory built on the RCS Q/E. And indeed, 
just as one would hope: the hypersurface with L = and also with vanishing linear 
momentum, with its dynamics, is the symplectic manifold and dynamics that is the 
Hamiltonian version of Barbour and Bertotti's 1982 Relational theory of point- 
particles. In terms of (l)'s notation, this hypersurface is Mq- 

We can sum up this comparison as follows. On this hypersurface Mq, the dynamics 
in the reduced phase space coincides with the dynamics one obtains for the relative 
variables, if one arbitrarily embeds their initial values in the usual absolutist phase 
space T*1R^^, subject to the constraint that the total angular and linear momenta 
vanish, and then reads off (just by projection) their evolution from the usual evolution 
in T*IR3^. 

2.3.5 Comparing the Relationist and Reductionist procedures 

In comparing the Relationist and Reductionist procedures, I shall just make just two ex- 
tended comments, and refer to Belot for further discussion. The gist of both comments 
is that Reductionism suffices: Relationism is not needed. The first is a commonplace 
point; the second is due to Belot. 

2. 3. 5. A Reductionism allows for rotation The first comment reiterates the Re- 
ductionist's ability, and the Relationist's inability, to endorse Newton's globes (or 
bucket) thought-experiment. The Reductionist can work in either 

(i) the (6A^ — 6)-dimensional phase space M = M/E; or 

(ii) the (6A^ — 9)-dimensional hypersurface got from (i) by specifying the centre of 
mass' linear momentum; or 

(iii) the (6A^ — 10)-dimensional hypersurface got from (ii) by also specifying a non- 
zero value of L. 

In all three cases, the Reductionist can describe rotation in a way that the Relation- 
ist with their (6A^ — 12)-dimensional space cannot. For she has to hand the three extra 
non-relative variables (L and two others) that describe the rotation of the system as a 
whole. (Incidentally: that they describe the system as a whole is suggested by there 
being just three of them, whatever the value of A^.) In particular, she can distinguish 
states of rotation and non-rotation (L = 0), in the sense of endorsing the distinctions 
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advocated by the globes and bucket thought-experiments. 

The Reductionist can also satisfy a traditional motivation for relationism, which 
concerns general philosophy, rather than the theory of motion. It is especially associ- 
ated with Leibniz: namely, our theory (or our metaphysics) should not admit distinct 
but utterly indiscernible possibilities. One might well ask why we should endorse this 
"principle of the identity of indisccrniblcs" for possibilities rather than objects. For 
Leibniz himself, the answer lies (as Belot's (2001) brings out) in his principle of suffi- 
cient reason, and ultimately in theology. 

But in any case the Reductionist can satisfy the requirement. Agreed, the usual 
absolutist theory cast in T*1R^^ (or if you prefer, M = T*1R^^ - (5 U A)) has nine 
variables that describe (i) the position of the centre of mass, (ii) the orientation of the 
system about its centre of mass, and (iii) the system's total linear momentum: i.e. 
three variables, a vector in IR^, for each of (i)-(iii). So the usual absolutist theory has 
a nine-dimensional "profligacy" of distinct but indiscernible possibilities. But as we 
have seen, the Reductionist quotients by the action of the euclidean group E, and so 
works in M = M/E: which removes the proflgacy about (i) and (ii). 

As to (iii), 1 agree that for all I have said, a job remains to be done. The foliation 
of M by a three real-parameter family of {6N — 9)-dimensional hypersurfaces, labelled 
by the system's total linear momentum, codifies the profligacy — but does not eliminate 
it. But as I mentioned above (cf. footnote 16), the Reductionist can in fact quotient 
further, by considering the action of Galilean boosts and identifying phase space points 
that differ by a boost; i.e. defining orbits transverse to these hypersurfaces. 

2.3.5.B Analogous reductions in other theories I close my philosophers' appe- 
tizer for symplectic reduction by summarizing some general remarks of Belot's (2001: 
Sections VIII- IX); cf. also his (2003a, Sections 12, 13). They are about how our dis- 
cussion of relational mechanics is typical of many cases; and how symplectic reduction 
can be physically important. 1 label them (l)-(3). 

(1): A general contrast: when to quotient: — 
The example of point-particles interacting by Newtonian gravity is typical of a 
large class of cases (infinite-dimensional, as well as finite-dimensional). There is a 
configuration space Q, acted on by a continuous group G of symmetries, which lifts to 
the cotangent bundle T*Q, with the cotangent lift leaving invariant the Hamiltonian, 
and so the dynamics. So we can quotient T*Q by G to give a reduced theory. (There 
is a Lagrangian analogue; but as above, we set it aside.) But there is also some 
motivation for quotienting G"s action on Q, irrespective of how we then go on the 
construct dynamics. Let us adopt 'relationism' as a mnemonic label for whatever 
motivates quotienting the configuration space. Then with suitable technical conditions 
assumed (recall our excision of 6 and A), we will have: 

(i) : for the reduced Hamiltonian theory: dim{{T*Q)/G) = 2 dimQ - dimG; 

(ii) : for the relationist theory, in a Lagrangian or Hamiltonian framework: 
dim(T(g/G')) = dim(r*(g/G')) = 2(dimQ - dimC) 



9Q 



So we have in the reduced theory, dim G variables that do not occur in the relationist 
theory: let us call them 'non-relational variables'. 

(2) : The non-relational variables: — 

Typically, these non-relational variables represent global, i.e. collective, properties of 
the system. That is unsurprising since the number, dim G, of these variables is inde- 
pendent of the number of degrees of freedom of the system (dim Q, or 2dim Q if you 
count rate of change degrees of freedom separately). 

Some of these variables are conserved quantities, which arise (by Noether's theo- 
rem) from the symmetries. Furthermore, there can be specific values of the conserved 
quantities, like the vanishing angular momentum of Section 12.3.41 for which the re- 
duced theory collapses into the relationist theory. That is, not only are the relevant 
state spaces of equal dimension; but also their dynamics agree. 

(3) : The reduced theory: — 

Typically, the topology and geometry of the reduced phase space {T*Q)/G, and the 
Hamiltonian function on it, H : {T*Q)/G —>■ IR say, are more complex than the 
corresponding features of the unreduced theory on T*Q. In particular, the reduced 
Hamiltonian H typically has potential energy terms corresponding to forces that are 
absent from the unreduced theory. But this should not be taken as necessarily a defect, 
for two reasons. 

First, there are famous cases in which the reduced theory has a distinctive mo- 
tivation. One example is Hertz' programme in mechanics, viz. to "explain away" 
the apparent forces of our macroscopic experience (e.g. gravity) as arising from re- 
duction of a theory that has suitable symmetries. (The programme envisaged cyclic 
variables for microscopic degrees of freedom that were unknown to us; cf. Lutzen (1995, 
2005).) Another famous example is the Kaluza-Klein treatment of the force exerted on 
a charged particle by the electromagnetic field. That is: the familiar Lorentz force-law 
describing a charged particle's motion in four spacetime dimensions can be shown to 
arise by symplectic reduction from a theory postulating a spacetime with a fifth (tiny 
and closed) spatial dimension, in which the particle undergoes straight-line motion. 
Remarkably, the relevant conserved quantity, viz. momentum along the fifth dimen- 
sion, can be identified with electric charge; so that the theory can claim to explain the 
conservation of electric charge. (This example generalizes to other fields: for details 
and references, cf. Marsden and Ratiu (1999, Section 7.6).) 

Second, the reduced theory need not be so complicated as to be impossible to work 
with. Indeed, these two examples prove this point, since in them the reduced theory 
is entirely tractable: for it is the familiar theory — that one might resist abandoning 
for the sake of the postulated unreduced theory. Besides, Belot describes how, even 
when the reduced theory seems complicated (and not just because it is unfamiliar!), 
the general theory of symplectic reduction, as developed over the last forty years, has 

^^And here one should resist being prejudiced because of famiharity. Why not have Newtonian 
gravity arise from a microscopic cyclic degree of freedom? Why not have the Lorentz force law arise 
from geodesic motion in a five-dimensional spacetime with the fifth dimension wrapped up, so that 
conservation of charge is explained, in Noether's theorem fashion, by a symmetry? 



shown that one can often "do physics" in the reduced phase space: and that, as in the 
Kaluza-Klein example, the physics in the reduced phase space can be heuristically, as 
well as interpretatively, valuable. 

3 Some geometric tools 

So much by way of an appetizer. The rest of the Chapter, comprising this Section 
and the next four, is the five-course banquet! This Section expounds some modern 
differential geometry, especially about Lie algebras and Lie groups. Section HI takes 
up actions by Lie groups. Then Sectional describes Poisson manifolds as a generalized 
framework for Hamiltonian mechanics. As I mentioned in (2) of Section 17!^ Lie himself 
developed this framework; so in effect, he knew everything in these two Sections — so 
it is a true (though painful!) pun to say that these three Sections give us the "Lie of 
the land". In any case, these two Sections will prepare us for Section IHIs description 
of symmetry and conservation in terms of momentum maps. Finally, Section will 
present one of the main theorems about symplectic reduction. It concerns the case 
where the natural configuration space for a system is itself a Lie group G; (cf. (3) of 
Section 12121). Quotienting the natural phase space (the cotangent bundle on G) will 
give a Poisson manifold that "is" the dual of G's Lie algebra. 

In this Section, I first sketch some notions of differential geometry, and fix notation 
f Section 13. Then I introduce Lie algebras and Lie brackets of vector fields (Section 
13. 2|) . Though most of this Section (indeed this Chapter!) is about differential rather 
than integral notions, I will later need Frobenius' theorem, which I present in Section 
13.31 Then I give some basic information about Lie groups and their Lie algebras 
(Section E31)- 

3.1 Vector fields on manifolds 

3.1.1 Manifolds, vectors, curves and derivatives 

By way of fixing ideas and notation, I begin by giving details about some ideas in 
differential geometry (some already used in Section [2.11) . and introducing some new 
notation for them. 

A manifold M will be finite-dimensional, except for obvious and explicit excep- 
tions such as the infinite-dimensional group of diffeomorphisms of a (as usual: finite- 
dimensional!) manifold. I will not be concerned about the degree of differentiability in 
the definition of a manifold, or of any associated geometric objects: 'smooth' can be 
taken throughout what follows to mean C°°. I will often not be concerned with global, 
as against local, structures and results; (though the reduction results we are driving 
towards are global in nature). For example, I will not be concerned about whether 
curves are inextendible, or flows are complete. 
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I shall in general write a vector at a point a; G M as X; or in terms of local 
coordinates x\ as X = X^-£j (summation convention). From now on, I shall write the 
tangent space at a point x G M as T^M (rather than just T^), thus explicitly indicating 
the manifold M to which it is tangent. As before, I write the tangent bundle, consisting 
of the "meshing collection" of these tangent spaces, as TM. Similarly, I write a 1-form 
(covector) at a point x E M as a; and so the cotangent space at a; e M as T*M; and 
as before, the cotangent bundle as T*M. 

A smooth map f : M N between manifolds M and N (maybe N = M) maps 
smooth curves to smooth curves, and so tangent vectors to tangent vectors; and so 
on for 1-forms and higher tensors. It is convenient to write T/, called the derivative 
or tangent of / (also written as or df or -D/), for the induced map on the tangent 
bundle. 

In more detail: let us take a curve c in M to be a smooth map from an interval 
/ C IR to M, and a tangent vector at x G M, X G T^M, to be an equivalence class 
[c]x of curves through x. (The equivalence relation is that the curves be tangent at x, 
with respect to every local chart at x; but I omit the details of this.) Then we define 
Tf -.TM ^ TN (also written : TM TN) by 

U[cl) = T/([c],) := [/ o c]y(,.), for all x e M. (3.1) 

We sometimes write T^f for the restriction of Tf to just the tangent space T^M at x; 
i.e. 

TJ : [cl G T,.M ^ [/ o c]y(,) G T;(,)iV. (3.2) 

In Section I3.1.2I B. we will discuss how one can instead define tangent vectors to be 
differential operators on the set of all scalar functions defined in some neighbourhood 
of the point in question, rather than equivalence classes of curves. One can then define 
the tangent map /* = Tf in a way provably equivalent to that above. 

3.1.2 Vector fields, integral curves and flows 

We will be especially concerned with vector fields defined on M, i.e. X : x & M 
X{x) G TxM, or on a subset U C M. So suppose that X is vector field on M and 
f : M —>■ N is a. smooth map, so that T^f : T^M — > Tf(^x)X . 

3. 1.2. A Push-forwards and pullbacks It is important to note {Txf){X{x)) does 
not in general define a vector field on N . For f{M) may not be all of N, so that for 
y G (X — ran(/)) (Txf){X{x)) assigns no element of TyN. And / may not be injective, 
so that we could have x,x' E M and f{x) = f{x') with (Tr^f){X{x)) ^ (T^//)(X(x')). 
Thus we say that vector fields do not push forward. 

On the other hand, suppose that f : M N is a. diffeomorphism onto N: that 
is, the smooth map / is a bijection, and its inverse f~^ is also smooth. Then for any 
vector field X on M, Tf{X) is a vector field on N. So in this case, the vector field does 
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push forward. Accordingly, Tf{X) is called the push-forward of X; it is often written 
as f*{X). So for any x G M, the pushed forward vector field at the image point f{x) 
is given by 

(/.(X))(/(x)):=r,./-X(x). (3.3) 
(Note the previous use of the asterisk-subscript for the derivative of /, in eq. 13.11 ) 
This prompts three more general comments. 

(1) : More generally: we say that two vector fields, X on M and Y on A^, are /- 
related on M (respectively: on C M) if (T/)(X) = F at all x G M (respectively: 
X e S). 

(2) : We can generalize the idea that a diffeomorphism implies that a vector field can 
be pushed forward, in two ways. First, the diffeomorphism need only be defined locally, 
on some neighbourhood of the point x G M of interest. Second, a diffeomorphism 
establishes a one-one correspondence, not just between vector fields defined on its 
domain and codomain, but also between all differential geometric objects defined on 
its domain and codomain: in particular, 1-form fields, and higher rank tensors. 

(3) : (This continues comment (2).) Though vector fields do not in general push for- 
ward, 1-form fields do in general pull back. This is written with an asterisk-superscript. 
That is: for any smooth f : M —>■ N, not necessarily a diffeomorphism (even locally), 
and any 1-form field (differential 1-form) a on A^, we define the pullback /*(«) to be 
the 1-form on M whose action, for each x G M, and each X G T^M, is given by: 

(r(a))(X):=a|;(.)(T/(X)). (3.4) 

Similarly, of course if the map / is defined only locally on a subset of M: a 1-form 
defined on the range of / pulls back to a 1-form on the domain of /. 

3.1.2.B The correspondence between vector fields and flows The leading 
idea about vector fields is that, for any manifold, the theorems on the local existence, 
uniqueness and differentiability of solutions of systems of ordinary differential equa- 
tions (e.g. Arnold (1973: 48-49, 77-78, 249-250), Olver (2000: Prop 1.29)) secure a 
one-one correspondence between four notions: 

(i) : Vector fields X on a subset U C M, on which they are non-zero; X : x eU ^— 
X(x) Gr,M,X(x) ^0; 

(ii) : Non-zero directional derivatives at each point x G f/, in the direction of the 
vector X(x). In terms of coordinates x = x^,...,x", these are first-order linear dif- 
ferential operators X^(x)g|j- + . . . + X"(x)^, with X*(x) the i-component in this 
coordinate system of the vector X(x). Such an operator is often introduced abstractly 
as a derivation: a map on the set of smooth real- valued functions defined on a neigh- 
bourhood of X, that is linear and obeys the Leibniz rule. 

(iii) : Integral curves (aka: solution curves) of the fields X in U\ i.e. smooth maps 
(j) : I ^ M from a real open interval / C IR to f/, with G /, 0(0) = x G f/, and whose 
tangent vector at each 0(r), r G / is X(0(r)). 
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(iv): Flows X"^ mapping, for each field X and each x & U, some appropriate subset 
of U to another: X'^ : U ^ M. This flow is guaranteed to exist only in some neigh- 
bourhood of a given point x, and for r in some neighbourhood of G IR; but this will 
be enough for us. Such a flow is a one-parameter subgroup of the "infinite-dimensional 
group" of all local diffeomorphisms. 

1 spell out this correspondence in a bit more detail: — In local coordinates x^, x", 
any smooth curve : / — >■ M is given by n smooth functions <P{t) = (0"^(r), 0"'(r)), 
and the tangent vector to at 0(t) E M is 

'^(-)='^V)^ + ... + 0"(r)^. (3.5) 

So for (p to be an integral curve of X requires that for alH = 1, n and all r G / 

0^(r) = X^(r). (3.6) 

The local existence and uniqueness, for a given vector field X and x G M, of the 
integral curve through x (with 0(0) = x) then ensures that the flow, written 
either as X'^ or as (f)x{T) 

X^ -.xeM^ X^(x) = (t>x,x{^) e M , (3.7) 

is (at least locally) well-deflned. The flow is a one-parameter group of transformations 
of M, and X is called its infinitesimal generator. 
The exponential notation 

exp(rX)(a;) := X^{x) = 0x,.(r) (3.8) 

is suggestive. For example, the group operation in the flow, i.e. 

X^+''{x)^X^X''{x)) , 

is written in the suggestive notation 

exp((r + (t)X){x) = exp(rX)(exp(aX)(x)) . (3.10) 

So computing the flow for a given X (i.e. solving a system of n flrst-order differential 
equations!) is called exponentiation of the vector held X. 

Remark: — The above correspondence can be related to our discussion of diffeo- 
morphisms and pushing forward vector fields. In particular: if two vector fields, X on 
M and Y on iV, are f-related by / : M ^ iV, so that {Tf){X{x)) = Y{f{x)), then / 
induces a map from integral curves of X to integral curves of Y. We can express this 
in terms of exponentiation of X and Y — (Tf) (X) : 

/(exp(rX)a;) = exp(r(T/)(X))(/(a;)). (3.11) 

Remark: — I emphasise that the above correspondence between (i), (ii), (iii) and 
(iv) is not true at a single point. More precisely: 
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(a) : On the one hand: the correspondence between (i) and (ii) holds at a point; 
and also holds for zero vectors. That is: a single vector X G T^M corresponds to a 
directional derivative operator (derivation) at x] and X = corresponds to the zero 
derivative operator mapping all local scalars to 0. (Indeed, as I mentioned: vectors are 
often defined as such operators/derivations). But: 

(b) : On the other hand: the correspondence between (i) and (iii), or between (i) 
and (iv), requires a neighbourhood. For a single vector X G T^M corresponds to a 
whole class of curves (and so: of flows) through x, not to a single curve. Namely, it 
corresponds to all the curves (flows) with X as their tangent vector. 

However, we shall see (starting in Section 13. 4|) that for a manifold with suitable 
extra structure, a single vector does determine a curve. (And we will again talk of 
exponentiation. ) 

We need to generalize one aspect of the above correspondence (i)-(iv), namely the 
(i)-(ii) correspondence between vectors and directional derivatives. This generalization 
is the Lie derivative. 



3.1.3 The Lie derivative 

Some previous Sections have briefly used the Lie derivative. Since we will use it a lot 
in the sequel, we now introduce it more thoroughly. 

We have seen that given a vector field X on a manifold M, a point x G M, and any 
scalar function / defined on a neighbourhood of x, there is a naturally defined rate of 
change of / along X at x: the directional derivative X{x){f). 

Now we will define the Lie derivative along X as an operator Cx that defines a rate 
of change along X: not only for locally defined functions (for which the definition will 
agree with our previous notion, i.e. we will have Cx{f) = -^(/)); but also for vector 
fields and differential 1-forms.^^ We proceed in three stages. 

(1): We first define the Lie derivative as an operator on scalar functions, in terms 
of the vector field X on M. We define the Lie derivative along the field X (aka: the 
derivative in the direction of X), Cx, as the operator on scalar functions / : M — IR 
defined by: 

/:x:/^/:x/:M^lR with VxGM: {Cxf){x) := \r=o f{X^{x)) = X{x){f). 

dr 

(3.12) 

Though this definition assumes that both X and / are defined globally, i.e. on all of 
M , it can of course be restricted to a neighbourhood. Thus defined, Cx is linear and 
obeys the Leibniz rule, i.e. 

Cx{fg) = fCx{g)+gCx{f) ; (3.13) 

^^Indeed, the definition can be extended to all higher rank tensors. But I will not develop those 
details, since — apart from Section [2. 1.31 s mention of the Lie derivative of the symplectic form Cx^jJ 
(viz. the requirement that if X is a symmetry, Cx^ — 0) — we shall not need them. 



In coordinates x = x^, x^, Cxf is given by 



^-^ = ^'^")|t + ---+^"W^' (3-14) 

with X*(x) the i-component of the vector X{x). Eq. 13. 141 means that despite eq. I3.12[ s 
mention of the flow X'^, the Lie derivative of a scalar agrees with our previous notion 
of directional derivative: that is, for all /, Cx{f) = X{f). 

(2) : In (1), the vector field X determined the operator Cx'- in terms of Section 
l3.1.2[ B's correspondence, we moved from (i) to (ii). But we can conversely define a 
vector field in terms of its Lie derivative; and in Section I3.2.2f s discussion of the Lie 
bracket, we shall do exactly this. 

In a bit more detail: — We note that the set J^{M) of all scalar fields on M, / : 
M — > IR forms an (infinite-dimensional) real vector space under pointwise addition. 
So also does the set X{M) of all vector fields on M, X : x G M t-^ X{x) G T^M. 
Furthermore, X{M) is isomorphic as a real vector space, and as an module over the 
scalar fields, to the collection of operators Cx- The isomorphism is given by the map 
e-.X^Cx defined in (1). 

(3) : We now extend the definition of Cx so as to define it on vector fields Y 
and 1-forms a. We can temporarily use 9 as notation for either a vector field Y or a, 
differential 1-form a. Given a vector field X and flow = (pxi'T)-, we need to compare 
9 at the point x G M with 9 at the nearby point X'^{x) = 4'x,x{'^), in the limit as r 
tends to zero. But the value of 9 at X'^{x) is in the tangent space, or cotangent space, 
at X'^(x): Txr(x)M or TJ^^-^^M. So to make the comparison, we need to somehow 
transport back this value to T^M or T*M. 

Fortunately, the vector field X provides a natural way to define such a transport. 
For the vector field Y, we use the differential (i.e. push-forward) of the inverse flow, 
to "get back" from X^(x) to x. Using 4'*{t) for this "pullback" of (f)x,x{T), we define 

0*(r) := T(exp(-rX)) = dexp(-rX) : Tx.(.)M = Te.p(.x)wM ^ T,M . (3.15) 

For the 1-form a, we define the transport by the pullback, already defined by eq. 13.41 

0*(r) := (exp(-rX))* : T1.(,)M ^ T:^^^,x)i.)M - T;M . (3.16) 

With these definitions of 0*(t), we now define the Lie derivative Cx9, where 9 is 
a vector field Y or a differential 1-form a, as the vector field or differential 1-form 
respectively, with value at x given by 

lim = -T- r=o [r)[9 \xr(x)) ■ 3.17 

T^o r dr 

Finally, an incidental result to illustrate this Chapter's "story so far". It connects 
Noether's theorem, from Section B.1.3l to this Section's details about the Lie derivative, 
and to the theorem stating the local existence and uniqueness of solutions of ordinary 



differential equations (cf. the start of Section l!3.1.2I B). This latter theorem implies that 
on any manifold any vector field X can be "straightened out" , in the sense that around 
any point at which X is non-zero, there is a local coordinate system in which X has 
all but one component vanish and the last component equal to 1. Using this theorem, 
it is straightforward to show that on any even-dimensional manifold any vector field 
X is locally Hamiltonian, with respect to some symplectic form, around a point where 
X is non-zero. One just defines the symplectic form by Lie-dragging from a surface 
transverse to X's integral curves. 

3.2 Lie algebras and brackets 

I now introduce Lie algebras and the Lie bracket of two vector fields. 
3.2.1 Lie algebras 

A Lie algebra is a vector space V equipped with a bilinear anti-symmetric operation, 
usually denoted by square brackets (and called 'bracket' or 'commutator'), [, ] : VxV ^ 
V, that satisfies the Jacobi identity, i.e. 

[[X,Y],Z] + [[Y,Z],X] + [[Z,X],Y] = . (3.18) 

3. 2.1. A Examples; rotations introduced Here are three examples. 

(i) : n X n matrices equipped with the usual commutator, i.e. [X, Y] := XY — YX. 
(So the matrix multiplication "contributes" to the bracket, but not to the underlying 
vector space structure.) 

(ii) : 3x3 anti-symmetric matrices, equipped with the usual commutator. 

(iii) : IR^ equipped with vector multiplication. In fact, example (iii) is essentially 
the same as example (ii); and this example will recur in what follows, in connection 
with rotations and the rigid body. (We will also see that example (ii) is in a sense more 
fundamental.) 

To explain this, we first recall that every anti-symmetric operator A on a three- 
dimensional oriented euclidean space is the operator of vector multiplication by a fixed 
vector, uj say. That is: for all q, = [cu, q] = u A q. (Proof: the anti-symmetric 
operators on for a 3-dimensional vector space, since an anti-symmetric 3x3 matrix 
has three independent components. Vector multiplication by a vector ci; is a linear and 
anti-symmetric operator; varying u we get a subspace of the space of all anti-symmetric 
operators on IR^; but this subspace has dimension 3; so it coincides with the space of 
all anti-symmetric operators.) 

With this result in hand, the following three points are all readily verified. 
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(1): The matrix representation of A in cartesian coordinates is then 



/ -Us UJ2 \ 

A = I uJs -uji \ . (3.19) 

\ -LaJ2 COi / 

We can write ^ 

A^ uj or Aij = -eijkUJk or Ui = --eijkAjk. (3.20) 

(2) : The plane 11 of vectors perpendicular to u is an invariant subspace for A, i.e. 
A{Il) = n. And u is an eigenvector for A with eigenvalue 0. This suggest a famihar 
elementary interpretation, which will be confirmed later f Section 13. 4|) : viz. that any 
3x3 anti-symmetric matrix A represents a infinitesimal rotation, and u represents 
instantaneous angular velocity. That is, we will have, for all q G H^: q = Aq = [cu, q]. 

(3) : The commutator of any two 3x3 anti-symmetric matrices A, B, i.e. [A, B] : = 
AB — BA, corresponds by eq. 13.201 to vector multiplication of the axes of rotation. 



That is: writing eq. I3.20[ s bijection from vectors to matrices as B : i— > A =: Q{uj), 
we have for vectors q, r, s 

(e(q)e(r) - 0(r)0(q))s = e(q)[r,s] - 0(r)[q,s] (3.21) 

= [q,[r,s]]-[r,[q,s]] (3.22) 

= [[q,r],s] = 0([q,r])-s. (3.23) 

where the [,] represents vector multiplication, i.e. [q, r] = q A r. 



Eq. 13.231 means that gives a Lie algebra isomorphism; and so our example (iii) 
is essentially the same as example (ii). 

Besides, we can already glimpse why example (ii) is in a sense more fundamental. 
For this correspondence between anti-symmetric operators (or matrices) and vectors, 
eq. 13.201 is specific to three dimensions. In n dimensions, the number of independent 
components of an anti-symmetric matrix is n{n — l)/2: only for n = 3 is this equal 
to n. Yet we will see later (Section I3.4.4|) that rotations on euclidean space IR" of 
any dimension n are generated, in a precise sense, by the Lie algebra of n x n anti- 
symmetric matrices. So only for n = 3 is there a corresponding representation of 
rotations by vectors in IR". 

In the next two Subsections, we shall see other examples of Lie algebras: whose 
vectors are vector fields (Section I3.2.2|) . or tangent vectors at the identity element 
of a Lie group f Section 13. 4j) . The first example will be an infinite-dimensional Lie 
algebra; the second finite-dimensional (since we will only consider finite-dimensional 
Lie groups). Besides, the above examples (i) and (ii) (equivalently: (i) and (iii)) will 
recur: each will be the vector space of tangent vectors at the identity element of a Lie 
group. 

3.2.1.B Structure constants A finite-dimensional Lie algebra is characterized, rel- 
ative to a basis, by a set of numbers, called structure constants that specify the bracket 



operation. Thus if {vi,...,Vn} is a basis of a Lie algebra V, we define the structure 
constants c^j, k = 1, n) by expanding, in terms of this basis, the bracket of any 
two basis elements 

[v^,v,]=J:k4vk . (3.24) 

The bilinearity of the bracket implies that eq. 13.241 determines the bracket of all pairs 
of vectors v,w &V. And the bracket's obeying anti-symmetry and the Jacobi identity 
implies that, for any basis, the structure constants obey 

'^ij = j ^fc(cfjC^ + cfjC^- + c^jC^) = (3.25) 

Conversely, any set of constants cf^ obeying eq. 13.251 are the structure constants of an 
n-dimensional Lie algebra. 



3.2.2 The Lie bracket of two vector fields 



Given two vector fields X, y on a manifold M, the corresponding flows do not in general 
commute: X*y* ^ V^X^. The non-commutativity is measured by the commutator of 
the Lie derivatives of X and of Y, i.e. Cx^Cy — CyHx- (Cf. eq. 13.121 and 13.171 for a 
definition of the Lie derivative.) Here, 'measured' can be made precise by considering 
Taylor expansions; but I shall not go into detail about this. 

What matters for us is that this commutator, which is at first glance seems to be a 
second-order operator, is in fact a first-order operator. This is verified by calculating in 
a coordinate system, and seeing that the second derivatives occur twice with opposite 
signs: 

- = E, A - E, Y>§- (E.A-f ) (3.26) 

/ -dY^ ■dX^\ df 
^...^^,„{x-^-Y'—)^. (3.27) 

So C-xl^Y — C.yI^x corresponds to a vector field: (recall (2) of Section 13.1.31 about 
defining a vector field from its Lie derivative). We call this field Z the Lie bracket (also 
known as: Poisson bracket, commutator, and Jacobi-Lie bracket!) of the fields X and 
y, and write it as [X, Y\. It is also written as LxY and called the Lie derivative ofY 
with respect to X. (Beware: some books use an opposite sign convention.) 

Thus Z = [X, Y] = CxY is defined to be the vector field such that 

C-z = ^[x,Y] = CxCy — Cy^x ■ (3.28) 

It follows that Z = [X, y]'s components in a coordinate system are given by eq. 
13.271 This formula can be remembered by writing it (with summation convention, i.e. 
omitting the S) as 



X^S-.Y^-^ 



X-^Ji^-Y^fl^ (3.29) 

dx' dx^ dx^ dx' 



Another way to write eq. 13.271 is as: 



[X, Y]> = (X ■ V)Y^ - {Y ■ V)X^ ; (3.30) 

or without coordinates, writing D for the derivative map given by the Jacobian matrix, 
as 

[X, r] = DF ■ X - DX ■ F. (3.31) 

Again, the vector field Z = [X, Y] measures the non-commutation of the fiows X* and 
Y^: in particular, these fiows commute iff [X, Y] = 0. 

We will need three results about the Lie bracket. They concern, respectively, the 
relation to Lie algebras, to Poisson brackets, and to Frobenius' theorem. 

(1) : The Lie bracket is obviously a bilinear and anti-symmetric operation on the 
(infinite-dimensional) vector space X{M) of all vector fields on M: [, ] : X{M) x 
X{M) —>■ X{M). One readily checks that it satisfied the Jacobi identity. (Expand 
C[[x,Y],z] = C.{XY\l^z~C.zC.{xy\ etc.) So: X{M) is an (infinite-dimensional) Lie algebra. 

(2) : Returning to Hamiltonian mechanics (Section 12.11) : there is a simple and 
fundamental relation between the Lie bracket and the Poisson bracket, via the notion 
of Hamiltonian vector fields f Section I2.1.3|) . 

Namely: the Hamiltonian vector field of the Poisson bracket of two scalar functions 
/, g on the symplectic manifold M is, upto a sign, the Lie bracket of the Hamiltonian 
vector fields, X/ and Xg, of / and g: 

= -\^S^^9\ = [Xg^Xf]- (3-32) 

Proof: apply the rhs to an arbitrary scalar h : M —>■ ]R. One easily obtains X{f^gj{h), 
by using: 

(i) the definition of a Hamiltonian vector field; 

(ii) the Lie derivative of a function equals its elementary directional derivative eq. 
IITT^ and 

(iii) the Poisson bracket is antisjnnmetric and obeys the Jacobi identity. 

This result means that the Hamiltonian vector fields on a symplectic manifold M, 
equipped with the Poisson bracket, form an (infinite-dimensional) Lie subalgebra of 
the Lie algebra X{M) of all vector fields on the symplectic manifold M. Later, it will 
be important that this result extends from symplectic manifolds to Poisson manifolds; 
(details in Section 15. 2. 2j) . 

(3) : For Frobenius' theorem f Section 13. 3p . we need to relate the Lie bracket to 
Section 13. 1.21 s idea of vector fields being /-related by a map f : M ^ N between 
manifolds M and N. In short: if two pairs of vector fields are /-related, so is their Lie 
bracket. More explicitly: if X, Y are vector fields on M, and / : M — > X is a map such 
that (T/)(X), (T/)(y) are well-defined vector fields on N, then Tf commutes with 
the Lie bracket: 

(T/)[X,r] = [(T/)X, (T/)r] . (3.33) 
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3.3 Submanifolds and Frobenius' theorem 

This Subsection differs from tlie preceding ones in three ways. First, it emphasises 
integral, rather than differential, notions. 

Second: Section K11.2I B have emphasised that the integral curves of a vector field 
correspond to integrating a system of ordinary differential equations. Since such curves 
are one- dimensional submanifolds of the given manifold, our present topic, viz. higher- 
dimensional submanifolds, naturally suggests partial differential equations. For their 
integration involves finding, given an assignment to each point x of a manifold M of a 
subspace S^^ (with dimension greater than one) of the tangent space T^M, an integral 
surface, i.e. a submanifold S" of M whose tangent space at each of its points is S^.^^ 

However, we will not be concerned with partial differential equations. For us, 
submanifolds of dimension higher than one arise when the span Sx of the tangent 
vectors at x to a set of vector fields fit together to form a submanifold. Thus Frobenius' 
theorem states, roughly speaking, that a finite set of vector fields is integrable in this 
sense iff the vector fields are in involution. That is: iff their pairwise Lie brackets are 
expandable in terms of the fields; i.e. the vector fields form a Lie subalgebra of the 
entire Lie algebra of vector fields. We will not need to prove this theorem. But we 
need to state it and use it — in particular, for the foliation of Poisson manifolds. 

Third: a warning is in order. The intuitive idea of a subset S G M that is a 
smooth manifold "in its own right" can be made precise in different ways. So there 
are subtleties about the definition of 'submanifold', and terminology varies between 
expositions — in a way it does not for the material in previous Sections. I will adopt 
what seems to be a widespread, if not majority, terminology. 



3.3.1 Submanifolds 

The fundamental definition is: 

Given a manifold M (dim(M)=n), a submanifold of M of dimension is a subset 
N (Z M such that for every y ^ N there is an admissible local chart (i.e. a chart in 

^^Beware: there is no analogue for partial differential equations of the local existence and uniqueness 
theorem for ordinary differential equations. Even a field of two-dimensional planes in three-dimensional 
space is in general not integrable, e.g. the field of planes given by the equation dz — ydx. So integrable 
fields of planes, or other tangent subspaces on a manifold, are an exception; and accordingly, the 
integration theory for partial differential equations is less unified, and more complicated, than that 
for ordinary differential equations. 

^''My treatment is based on Marsden and Ratiu (1999, p. 124-127, 140) for Section lOTI and 
Olver (2000, p. 38-40) for Section IXT^ As to varying terminology: Olver (2000, p. 9) defines 
'submanifold' to be what we will call an immersed submanifold; (which latter, for us, does not have 
to be a submanifold, since the immersion need not be an embedding). Bishop and Goldberg (1980, p. 
40-41) provide a similar example. For a detailed introduction to the different notions of submanifold, 
cf. Darling (1994, Chapters 3 and 5). Note that I will also omit some details, in particular about 
Frobenius' theorem providing regular immersions. 
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M's maximal atlas) {U, (p) with y (zU and with the suhmanifold property, viz. 

(SM). (t):U X WC-^ and 0(f/ N) = (f){U) f] (IR'^ x {0}). (3.34) 

The set becomes a manifold, generated by the atlas of all charts of the form {UdN, cj) \ 
{U n A^)), where {U, (p) is a chart of M having the submanifold property. (This makes 
the topology of the relative topology.) 

We need to take note of two ways in which submanifolds can be specified in terms 
of smooth functions between manifolds. 

(1) : A submanifold can be specified as the set on which a smooth function f : M ^ 
P between manifolds takes a certain value. In effect, this will be a generalization of 
eq. I3.34f s requirement that n — k coordinate-components of a chart (p take the value 
zero. This will involve the idea that the tangent map Tf is surjective, in which case / 
will be called a submersion. We will need this approach for quotients of actions of Lie 
groups. 

(2) : A submanifold can be specified parametrically, as the set of values of a local 
parametrization: i.e. as the range of a smooth function / with M as codomain. This 
will involve the idea that the tangent map T f is injective, in which case / will be called 
an immersion. We will need this approach for Frobenius' theorem. 

(1): Submersions: — 

If / : M ^ P is a smooth map between manifolds, a point x G M is called a regular 
point if the tangent map T^f is surjective; otherwise x is a critical point of /. If C C M 
is the set of critical points of M, we say f{C) is the set of critical values of /, and 
P — f{C) is the set of regular values of /. So if p G P is a regular value of /, then at 
every x G M with f{x) = p, T^f is surjective. 

The submersion theorem states that if p G P is a regular value of /, then: 

(1) : f~^{p) is a submanifold of M of dimension dim(M) - dim(P); and 

(ii): the tangent space of this submanifold at any point x G f~^{p) is the kernel of 
/'s tangent map: 

T,ir\p))=keTTJ . (3.35) 
If T^f is surjective for every x G M, / is called a submersion. 

(2) : Immersions: — 

A smooth map between manifolds / : M — P is called an immersion if T^f is injective 
at every x G M. The immersion theorem states that T^f is injective iff there is a 
neighbourhood U of x in M such that f{U) is a submanifold of P and f \u: U —>■ f{U) 
is a diffeomorphism. 

NB: This does not say that f{M) is a submanifold of P. For / may not be injective 
(so that /(M) has self-intersections). And even if / is injective, / can fail to be a 
homeomorphism between M and f{M), equipped with the relative topology induced 
from P. A standard simple example is an injection of an open interval of IR into an 
"almost-closed" figure-of-eight in IR^. 

Nevertheless, when / : M ^ P is an immersion, and is also injective, we call /(M) 
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an injectively immersed submanifold (or shorter: an immersed submanifold): though 
f{M) might not be a submanifold. 

We also define an embedding to be an immersion that is also a homeomorphism (and 
so injective) between M and f{M) (where the latter has the relative topology induced 
from P). If / is an embedding, f{M) is a submanifold of and / is a diffeomorphism 
/:M^/(M). 

In fact, Frobenius' theorem will provide injectively immersed submanifolds that 
need not be embedded, and so need not be submanifolds. (They must also obey 
another condition, called 'regularity', that I will not go into.) 

3.3.2 The theorem 

We saw at the end of Section 13.2.21 that if two pairs of vector fields are /-related, so 
is their Lie bracket: cf. eq. 13.331 This result immediately yields a necessary condition 
for two vector fields to be tangent to an embedded submanifold: namely 

If Xi,X2 are vector fields on M that are tangent to an embedded submanifold 5* 
(i.e. at each x E S, Xj(x) G T^S < T^M), then their Lie bracket [Xi,X2] is also 
tangent to 5*. 

This follows by considering the diffeomorphism f : S —>■ S that gives an embedding 
of S in M. One then uses the fact that Tf commutes with the Lie bracket, eq. 13.331 
That is: the Lie bracket of the /-related vector fields Xi,X2 on S, which is of course 
tangent to S, is carried by Tf to the Lie bracket [Xi, X2] of Xi and X2. So [Xi, X2] is 
tangent to S. 

The idea of Frobenius' theorem will be that this necessary condition of two vector 
fields being tangent to a submanifold is also sufficient. To be more precise, we need 
the following definitions. 

A distribution D on a manifold M is a subset of the tangent bundle TM such that 
at each x G M, := D (1 T^M is a vector space. The dimension of is the rank of 
D at X. If the rank of D is constant on M, we say the distribution is regular. 

A distribution is smooth if for every x G M, and every Xq e D^, there is a neigh- 
bourhood f/ C M of x, and a smooth vector field X on f/ such that (i) X(x) = Xq, 
(ii) for all y E U, X{y) E Dy. Such a vector field X is called a local section of D. Ex- 
ample: a set of r vector fields, Xi, ...,Xr each defined on M, together define a smooth 
distribution of rank at most r. 

A distribution is involutive if for any pair Xi,X2 of local sections, the Lie bracket 
[Xi,X2](?/) G Dy in the two sections' common domain of definition. 

We similarly say that a set of r smooth vector fields, Xi, ...,X^, on a manifold M 
is in involution if everywhere in M they span their Lie brackets. That is: there are 
smooth real functions h'lj : M ^ R,i, j, k = 1, r such that at each x G M 

[X„X,](a;) = Sfc h1^{x)Xk{x). (3.36) 
(Beware: involution is used in a different sense in connection with Liouville's theorem, 
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viz. a set of real functions on phase space is said to be in involution when all their 
pairwise Poisson brackets vanish.) 

A distribution D on M is integrable if for each x G M there is a local subman- 
ifold N{x) of M whose tangent bundle equals the restriction of D to N{x). If D is 
integrable, the various N{x) can be extended to get, through each x G M, a unique 
maximal connected set whose tangent space at each of its elements y is Dy. Such a set 
is called a (maximal) integral manifold. 

NB: In general, each integral manifold is injectively immersed in M, but not embed- 
ded in it; and so, by the discussion in (2) of Section Td. 3. 11 an integral manifold might 
not be a submanifold of M. But (like most treatments), I shall ignore this point, and 
talk of them as submanifolds, integral submanifolds. 

If the rank of D is constant on M, all the integral submanifolds have a common 
dimension: the rank of D. But in general the rank of D varies across M, and so does 
the dimension of the integral submanifolds. 

We similarly say that a set of r vector fields, Xi, X^,, is integrable; viz. if through 
every x G M there passes a local submanifold N{x) of M whose tangent space at each 
of its points is spanned by Xi, X^. (Again: we allow that at some x, Xi{x), Xr{x) 
may be linearly dependent, so that the dimension of the submanifolds varies.) 

We say (both for distributions and sets of vector fields) that the collection of integral 
manifolds is a foliation of M, and its elements are leaves. Again: if the dimension of 
the leaves is constant on M, we say the foliation is regular. 

With these definitions in hand, we can now state Frobenius' theorem: both in its 
usual form, which concerns the case of constant rank, i.e. regular distributions and 
vector fields that are everywhere linearly independent; and in a generalized form. The 
usual form is: 

Frobenius' theorem (usual form) A smooth regular distribution is in- 
tegrable iff it is involutive. 

Or in terms of vector fields: a set of r smooth vector fields, Xi, ...,Xj., on 
a manifold M, that are everywhere linearly independent, is integrable iff it 
is in involution. 

The generalization comes in two stages. The first stage concerns varying rank, but 
assumes a finite set of vector fields. It is straightforward: this very same statement 
holds. That is: a set of r smooth vector fields, Xi, ...,Xr, on a manifold M (perhaps 
not everywhere linearly independent) is integrable iff it is in involution. 

But for the foliation of Poisson manifolds f Section I5.3.3j) . we need to consider an 
infinite set of vector fields, perhaps with varying rank; and for such a set, this statement 
fails. Fortunately, there is a useful generalization; as follows. 

Let be a set of vector fields on a manifold M, that forms a vector space. So in 
the above discussion of r vector fields, X can be taken as all the linear combinations 
^[=1 fi{x)Xi{x),x G M, where the fi are arbitrary smooth functions / : M — IR. 
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Such an X is called finitely generated. 

For any X forming a vector space, we say (as before) that X is in involution if 
[X, Y] E X whenever X,Y G X. Let X^ be the subspace of T^M spanned by the X{x) 
for all X E X. As before, we define: an integral manifold of X is a submanifold N G M 
such that for all y E N, TyN = Xy] and X is called integrable iff through each x G M 
there passes an integral manifold. 

As before: if X is integrable, it is in involution. But the converse fails. A further 
condition is needed, as follows. 

We say that X is rank-invariant if for any vector field X E X, the dimension of 
the subspace A'exp(rX)(x) along the flow generated by X is a constant, independent of 
r. (But it can depend on the point x.) 

Since the integral curve exp(rX)(a;) through x should be contained in any integral 
submanifold, rank-invariance is certainly a necessary condition of integrability. (It also 
follows from X being finitely generated.) In fact we have: 

Probenius' theorem (generalized form) A system X of vector fields on 
M is integrable iff it is rank-invariant and in involution. 

The idea of the proof is to directly construct the integral submanifolds. The submani- 
fold through X is obtained as 

N = {exp(Xi)exp(X2)....exp(Xp)(a;) : p > G X}. (3.37) 

The rank-invariance secures that for any y E N, Xy has dimension dim(A^). 

3.4 Lie groups, and their Lie algebras 

I introduce Lie groups and their Lie algebras. By the last two Subsections (Sections 
13.4.31 and I3.4.4j) . we will have enough theory to compute efficiently the Lie algebra of 
a fundamentally important Lie group, the rotation group. 

3.4.1 Lie groups and matrix Lie groups 

A Lie group is a group G which is also a manifold, and for which the product and 
inverse operations G x G ^ G and G ^ G are smooth. 

Examples: — - 

(i) : IR"^ under addition. 

(ii) : The group of linear isomorphisms of IR" to IR", denoted GL{n, M) and called 
the general linear group; represented by the real invertible n x n matrices. This is 
an open subset of IR" , and so a manifold of dimension ra^; and the formulas for the 
product and inverse of matrices are smooth in the matrix components. 

(iii) The group of rotations about the origin of IR'^, represented by 3 x 3 orthogonal 



matrices of determinant 1; denoted 50(3), where S stands for 'special' (i.e. determi- 
nant 1), and O for 'orthogonal'. 

In fact, all three examples can be regarded as Lie groups of matrices, with matrix 
multiplication as the operation. In example (i), consider the isomorphism 6 between 
IR"" under addition and (n + 1) x [n + 1) matrices with diagonal entries all equal to 1, 
other rightmost column entries equal to the given vector in IR", and all other entries 
zero. Thus consider, for the case n = 3: 
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This suggests that we define a matrix Lie group to be any set of invertible real matrices, 
under matrix multiplication, that is closed under multiplication, inversion and taking 
of limits. That a matrix Lie group is a Lie group will then follow from GL{n, H) being 
a Lie group, and the theorem below (in Section 13. 4. 3^ that any closed subgroup of a 
Lie group is itself a Lie group. 

For matrix Lie groups, some of the theory below simplifies. For example, the 
definition of exponentiation of an element of the group's Lie algebra reduces to ex- 
ponentiation of a matrix. But we will develop some of the general theory, since (as 
always!) it is enlightening and powerful. 



3.4.2 The Lie algebra of a Lie group 

The main result in this Subsection is that for any Lie group G, the tangent space TgG 
at the identity e E G has a natural Lie algebra structure that is induced by certain 
natural vector fields on G; as follows. 

3. 4. 2. A Left-invariant vector fields define the Lie algebra 

Let G be a Lie group. Each g E G defines a diffeomorphism of G onto itself by left 
translation, and similarly by right translation: 

Lg : h e G ^ gh e G ; Rg : h E G ^ hg E G. (3.39) 

Remark: In Section ^ we will describe this in the language of group actions, saying 
that in eq. 13.391 G acts on itself hj left and right translation. 

Now consider the induced maps on the tangent spaces, i.e. the tangent (aka: deriva- 
tive) maps; cf. eq.s l3.1ll3.2[ They are (i^g)* =: Lg^, (Rg)* =: Rg* where for each h E G: 

Lg, : ThG ^ TghG and Rg, : ThG ^ ThgG. (3.40) 

In particular: the derivative {Rg)* at e G G maps T^G to TgG. This implies that 
every vector C, E T^G defines a vector field on G: its value at any g E G is the image 
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{Rg),^C, of ^ under (Rg)*. Such a vector field is called a right-invariant vector field: it 
is uniquely defined by (applying the derivative of right translation to) its value at the 
identity e E G. 

In more detail, and now defining left-invariant vector fields: — 
A vector field X on G is called left-invariant if for every g E G, {Lg)^X = X. More 
explicitly, let us write ThLg for the tangent or derivative of Lg at h, i.e. for Lg^ : T^G — > 
TghG. Then left-invariance requires that 

{ThLg)X{h) = X{gh) for every ^ and /i G G. (3.41) 

Thus every vector ^ G T^G defines a left-invariant vector field, written X^, on G: X^'s 
value at any g E G is the image {Lg)^,^ of ^ under {Lg)^,. In other words: Xi^{g) := 

{TeLg)i. 

Not only is a left-invariant vector field uniquely defined by its value at the identity 
e (z G. Also, the set Xl{G) of left-invariant vector fields on G is isomorphic as a vector 
space to the tangent space T^G at the identity e. For the linear maps a,/? defined by 

a : X G Xl{G) ^ X(e) G T^G ; and /5 : ^ G T^G ^ {(? ^ X5((7) := (T,L,)0 e A'z.(G) 

(3.42) 

compose to give the identity maps: 

(3 o a = idx^[G) ; ao (3 = idx^G- (3.43) 

Xl{G) is a Lie subalgebra of the Lie algebra of all vector fields on G, because it is 
closed under the Lie bracket. That is: the Lie bracket of left-invariant vector fields X 
and Y is itself left-invariant, since one can check that for every g E G we have (with L 
meaning 'left' not 'Lie'!) 

Lg, [X, Y] = [Lg,X, Lg,Y] = [X, Y] . (3.44) 

If we now define a bracket on TgG by 

[e,r^]:=[X5,X,](e) (3.45) 

then TeG becomes a Lie algebra. It is called the Lie algebra of G, written g (or, to 
avoid ambiguity about which Lie group is in question: 0(G)). It follows from eq. 13.441 
that 

[X^,X,]=X[^,,] ; (3.46) 
that is to say, the maps are Lie algebra isomorphisms. 

This result, that T^G has a natural Lie algebra structure, is very important. For, 
as we shall see in the rest of Section 13.41 the structure of a Lie group is very largely 
determined by the structure of this Lie algebra. Accordingly, as we shall see in Sections 
IHandElet seq.: this Lie algebra underpins most of the constructions made with the Lie 
group, e.g. in Lie group actions. Thus Olver writes that this result 'is the cornerstone of 
Lie group theory ... almost the entire range of applications of Lie groups to differential 
equations ultimately rests on this one construction!' (Olver 2000: 42). 

Before turning in the next Subsection to examples, and the topic of subgroups and 
subalgebras, I end with four results, (l)-(4), which will be needed later; and a remark. 
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3.4.2. B Four results : 

(1): Lie group structure determines Lie algebra structure in the following sense. If 
G, H are Lie groups, and f : G ^ H is a. smooth homomorphism, then the derivative 
of / at the identity Tef : q{G) — > q{H) is a Lie algebra homomorphism. In particular, 
for all e,r7 G 0(G), (Te/)[e,r/] = [Te/(0, TJ(r/)]. (Cf. eq. EM) 

(2): Exponentiation again; a correspondence between left-invariant vector fields and 
one- dimensional subgroups: 

Recall from Section 13.11 especially eq. 13.81 that each vector field X on the manifold G 
determines an integral curve (px in G passing through the identity e (with 0x(O) = e). 
We now write the points in (the image of) this curve as Qr {X and e being understood): 

exp(rX)(e) = X^(e) = 0x,e(r) =: gr- (3.47) 

It is straightforward to show that if X is left-invariant, this (image of a) curve is a one- 
parameter subgroup of G: i.e. not just as eq. 13.71 et seq., a one-parameter subgroup of 
the group of diffeomorphisms of the manifold G. In fact: 

9r+a = QrQa 9o = c g^^ = g^r ■ (3.48) 

Besides, the group is defined for all r G IR; and is isomorphic to either IR or the circle 
group 5*0(2). Conversely, any connected one-parameter subgroup of G is generated by 
a left-invariant vector field in this way. 

Accordingly, we define exponentiation of elements ^ of g by reference to the isomor- 
phisms eq. 13.421 and 13.431 It is also convenient to define this as a map taking values 
in G. Thus for ^ G g and its corresponding left-invariant vector field X^ that takes 
as value a.t g E G, X^{g) := (TeLg){^), we write the integral curve of X^ that passes 
through e (with value e for argument r = 0) as 

05 : r G IR ^ exp{rX^){e) G G . (3.49) 

Then we define the exponential map of g into G to be the map 

exp : e G ^ 0^(1) G G . (3.50) 

Using the hnearity of P as defined by eq. 13.421 these two equations, eq. 13.491 and 13.501 
are related very simply: 

exp(rO := M^) := exp{l.Xr^){e) = expirX^) . (3.51) 

We write exp^ rather than exp when the context could suggest a Lie group other than 
G. 

The map exp is a local diffeomorphism of a neighbourhood of G to a neighbour- 
hood of e G G; but not in general a global diffeomorphism onto G. In modern terms, 
this result follows by applying the inverse function theorem to the discussion above. 
(It also represents an interesting example of the history of subject; cf. Hawkins (2000: 
82-83) for Lie's version of this result, without explicit mention of its local nature.) 
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The map exp also has the basic property, adding to result (1) above, that ... 

(3) : Homomorphisms respect exponentiation: 

If / : G — s> if is a smooth homomorphism of Lie groups, then for all ^ G 0, 

/(expGO = expH((TJ)(0). (3.52) 

(4) : Right-invariant vector fields as an alternative approach: 

We have followed the usual practice of defining g in terms of left-invariant vector fields. 
One can instead use right-invariant vector fields. This produces some changes in signs, 
and in whether certain defined operations respect or reverse the order of two elements 
used in their definition. I will not go into many details about this. But some will be 
needed when we consider: 

(i) : Lie group actions, and especially their infinitesimal generators (Section 14.41 and 

EM; 

(ii) : reduction on the cotangent bundle of a Lie group — as occurs in the theory of 
the rigid body (Section 16.51 and I7.3.3p . 

For the moment we just note two basic results, (A) and (B); postponing others to 
Section lOl et seq.. 

(A) : Corresponding to the vector space isomorphism between q and the left-invariant 
vector fields, as in eq. 13.421 viz. 

^enC^X^e with X^{g) := (T,Lg)i , (3.53) 

there is a vector space isomorphism to the set of right-invariant vector fields 

e e T,G e Xr{G) with Y^{g) := {T,Rg)i . (3.54) 

Besides, the Lie bracket of right-invariant vector fields is itself right-invariant. So 
corresponding to our previous definition, eq. 13.451 of a Lie bracket on TeG, and its 
corollary eq. I3.46[ i.e. [X^,X^] = that makes TeG = Xl{G) a Lie algebra 

isomorphism: we can also define a Lie bracket on TeG by 

[e,r/]ij:=[n,n](e), (3.55) 

and get a Lie algebra isomorphism TeG = Xji{G). 

(B) : But the two Lie brackets, eq. 13.451 and I3.55t on TeG are different. In fact one 
can show that: 

(i): and 1^ are related by 

J^Xg = -Y^ (3.56) 

where / : G — > G is the inversion map I{g) := g^^, and is the push-forward on 
vector fields induced by /, cf. eq. 13.31 i.e. 

ihX^){g):={TIoX^oI-'){g). (3.57) 
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Besides, since J is a diffeomorphism, eq. 13.561 makes /* a vector space isomorphism, 
(ii): It follows from eq. 13.561 that 

[X^,X,]{e) = -[Y^,Y,]{e) ; so [i^rj] = -[i^ri]^ . (3.58) 

Finally, a remark about physics. In applications to physics, G is usually the group of 
symmetries of a physical system, and so a vector field on G is the infinitesimal generator 
of a one-parameter group of symmetries. For mechanics, we saw this repeatedly in 
Section 121 especially as regards the group of translations and rotations about the origin, 
in physical space IR^. This Subsection's isomorphism between the Lie algebra g and 
left-invariant vector fields on G means that we can think of g also as consisting of 
infinitesimal symmetries of the system. (The ^ G g are also called generators of the 
group G.) 

3.4.3 Examples, subgroups and subalgebras 

I begin with the first two of Section 13.4. If s three examples. That will prompt a little 
more theory, which will enable us to deal efficiently in the next Subsection with the 
third example, viz. the rotation group. 

(1): Examples: — 

(i) : G := IR" under addition. G is abelian so that left and right translation coincide. 
The invariant vector fields are just the constant vector fields, so that Xl{G) = Xr{G) = 
IR". So the tangent space at the identity TgG, i.e. the Lie algebra g, is itself IR". The 
bracket structure is wholly degenerate: for all invariant vector fields X, Y , [X, Y] = 0; 
and for all ^, G g, r]] = 0. 

(ii) : G := GL{n,]R), the general linear group. Since G is open in End(]R"',]R"'), 
the vector space of all linear maps on IR" {^End^ for 'endomorphism'), G"s Lie algebra, 
as a vector space, is End(iR^, IR""); (cf. example (i)). To compute what the Lie bracket 
is, we first note that any ^ G i?n(i(lR", IR"") defines a corresponding vector field on 
GL{n, IR) by 

X^:Ae GL{n, U) ^ E Enc/(1R", IR'") . (3.59) 
Besides, X^ is left-invariant, since for every B G GL{n, IR), the left translation 

Lb-.Ae GL{n, lR)^BAe GL{n, IR) (3.60) 

is linear, and so 

X^{LbA) = BAi = TaLbX^{A) . (3.61) 

Applying now eq. I3.31l at the identity / G GL{n, IR) to the definition of the bracket in 
the Lie algebra, eq. I3.45t we have: 

[^,r]] := [X^,X,](/) = DX,(/) ■ X^il) - BX^il) ■ X,(/). (3.62) 

But XrjA = Arj is linear in A, so DX^(/) ■ B = Br]. This means that 

DX,(/) ■ X^{I) = if] ■ (3.63) 



and similarly 

BX^iI)-X,iI)=r]^. (3.64) 

So the Lie algebra End{]R^, IR") has the usual matrix commutator as its bracket: 
[^^v] = ~ V^- This Lie algebra is often written Ql{n, IR). 

Let us apply to this example, result (2) from Section r3.4.2I B. In short, the result 
said that left-invariant vector fields correspond (by exponentiation through e G G) 
to connected one-parameter subgroups of G. To find the one-parameter subgroup 
exp(rXg)(e) of GL{n,'\R), we take the matrix entries Xij,{i,j = l,...,n) as the ri^ 
coordinates on GL{n, IR), so that the tangent space at the identity matrix / is the set 
of vectors 

d 

^ij ^ij ^ 1/ (3-65) 

with ^ = {^ij) an arbitrary matrix. For given ^, exp(rXg)e is found by integrating the 
ordinary differential equations 

= Sfc^ifcXfcj ; Xij{Q) = 5ij. (3.66) 
The solution is just the matrix exponential: 

X(r)=exp(rO. (3.67) 



More generally, let us return to Section 13.4. U s idea of a matrix Lie group. For a 
matrix Lie group G, the definition of its Lie algebra can be given as: 

= { the set of matrices ^ = 0'(O) : a differentiable map : IR — G, 0(0) = e^}. 

(3.68) 

The deduction of the structure of the Lie algebra then proceeds straightforwardly. In 
particular, we get the result that the one-parameter subgroup generated by e is 
given by matrix exponentials, as in eq. 13.671 the group is {exp(r,^) : r G IR}. 

This result will help us compute our third example: finding the Lie algebra of the 
rotation group. But for that example, it is worth first developing a little the result (2) 
from Section [3.4.21 3: i.e. the correspondence between left-invariant vector fields and 
connected one-parameter subgroups of G. 

(2): More theory: — 

First, a warning remark. We will later need to take notice of the fact that a sub- 
group, even a one-parameter subgroup, of a Lie group G need not be a submanifold of 
G. Here we recall Section l!j.3.H s definitions of immersion and embedding. Accordingly, 
we now define a subgroup H of a Lie group G to be a Lie subgroup of G if the inclusion 
map i : H ^ G is an injective immersion. 

Just as we saw in Section r3. 3. ll that not every injective immersion is an embedding, 
so also there are examples of Lie subgroups that are not submanifolds. Example: the 
torus T^ can be made into a Lie group in a natural way (exercise: do this!); the one- 
parameter subgroups on the torus T^ that wind densely on the torus are Lie subgroups 



that are not submanifolds. (For more details about this example, cf. Arnold (1973: 
160-167) or Arnold (1989: 72-74) or Butterfield (2004a: Section 2.1. 3.B).) 

But it turns out that being closed is a sufficient, and necessary, further condition. 
That is: 

If if is a closed subgroup of a Lie group G, then H is a submanifold of G 
and in particular a Lie subgroup. And conversely, if if is a Lie subgroup 
that is also a submanifold, then H is closed. 

Result (2) from Section r3.4.2I B. i.e. the correspondence between one- dimensional 
subgroups of G and one-dimensional subspaces (and so subalgebras) of g, generalizes 
to higher- dimensional subgroups and subalgebras. That is to say: 

If H C G is a Lie subgroup of G, then its Lie algebra t) := q{H) is a 
subalgebra of g = 0(G). In fact 

p, = G : exp(rX^)(e) G if , for all r G IR}. (3.69) 

And conversely, if f) is any m-dimensional subalgebra of g, then there is a 
unique connected m-dimensional Lie subgroup H of G with Lie algebra [). 

The proof of the ffist two statements uses result (1) of Section r3.4.2I B. For the third, 
i.e. converse, statement, the main idea is that [) defines m vector fields on G that 
are linearly independent and in involution, so that one can apply Frobenius' theorem 
to infer an integral submanifold. One then has to prove that if is a Lie subgroup: 
Olver (2000: Theorem 1.51) and Marsden and Ratiu (1999: 279-280) give details 
and references. (Historical note: to see that this result, sometimes called Lie's 'third 
fundamental theorem', is close to what Lie himself called the main theorem of his 
theory of groups, cf. Hawkins (2000: 83).) 

This general correspondence between Lie subgroups and Lie subalgebras prompts 
the question whether every finite-dimensional Lie algebra g is the Lie algebra of a Lie 
group. The answer is Yes. Besides, the question reduces to the case of a matrix Lie 
group (i.e. a Lie subgroup of GL(n, IR)), in the sense that: every finite-dimensional 
Lie algebra q is isomorphic to a subalgebra of 0l(n, IR), for some n. But be warned: 
this does not imply (and it is not true) that every Lie group is realizable as a matrix 
Lie group, i.e. that every Lie group is isomorphic to a Lie subgroup of GL{n, IR). 

This general correspondence also simplifies greatly the computation of the Lie alge- 
bras of Lie groups, for example H := 5*0(3), that are Lie subgroups of GL{n, IR). We 
only need to combine it with example (ii) above, that g[(n, IR) is £^71(^(11", IR") with 
the usual matrix commutator as its bracket: f]] = ^rj — rj^. 

Thus we infer that the Lie algebra of SO (3), written so (3), is a subalgebra of 
f^nc? ( IR", IR") with the matrix commutator as bracket. Besides, we can identify so(3) 
by looking at all the one-dimensional subgroups of G contained in it. Combining eq. 
13.671 and 13.691 we have 

so(3) = G 0[(ra, IR) : the matrix exponential exp(rO G ^0(3), Vr G IR}. (3.70) 



With this result in hand, we can now compute so (3). 



3.4.4 The Lie algebra of the rotation group 

Our first aim is to calculate the Lie algebra 5o(3) (also written: so(3)) of H := 50(3), 
the rotation group. This will lead us back to Section r3.2.1[ A's correspondence between 
anti-symmetric matrices and vectors in IR^. 

S'0(3) is represented by 3 x 3 orthogonal matrices of determinant 1. So the require- 
ment in eq. 13 .71)1 becomes, now writing e, not exp: 

(e"«)(e"«)^ = I and det(e"«) = L (3.71) 

Differentiating the first equation with respect to r and setting r = yields 

^ + ^"^ = 0. (3.72) 

So ^ must be anti-symmetric, i.e. represented by an anti-symmetric matrix. Conversely, 
for any such anti-symmetric matrix ^, we can show that det(e'^^) = 1. So, indeed: 

so (3) = {3x3 antisymmetric matrices}. (3.73) 

Notice that the argument is independent of choosing n = 3. It similarly computes 
so{n) for any integer n: 

so{n) = {n X n antisymmetric matrices}. (3.74) 

Thus the rotations on euclidean space IR" of any dimension n are generated by the Lie 
algebra of n x n anti-symmetric matrices. 

This justifies our assertion at the end of Section [3.2. ll A that the rotation group in 
three dimensions is special in being representable by vectors in the space on which it 
acts, i.e. IR^. For as we have just seen, in general the infinitesimal generators of rota- 
tions are anti-symmetric matrices, which in n dimensions have n{n — l)/2 independent 
components. But only for n = 3 does this equal n. 

Remark: An informal computation of so (3), based on the idea that higher-order 
terms in e'^^ can be neglected (cf. the physical idea that ^ represents an infinitesimal 
rotation), goes as follows. 

For (/ + r^) to be a rotation requires that 

(/ + rO(/ + = / and det(J + ^t) = 1. (3.75) 
Dropping higher-order terms, the first equation yields 

I + T{^ + e) = I i.e. ^ + e = 0. (3.76) 



Besides, the second equation in eq. 13.751 yields no further constraint, since for any 
anti-symmetric matrix ^ written as (cf. eq. I3.19j) 




(3.77) 



we immediately compute that det(/ + ^r) = 1 + t^(^^ + ^| + ^|). So, dropping higher- 
order terms, det(/ + ^r) = 1. In short, we again conclude that 

so (3) = {3x3 antisymmetric matrices}. (3.78) 

For later use (e.g. Sections 14.41 and 14.5. 1|) . we note that the three matrices 

/000\ /00l\ /0-10\ 

A"" = I -1 \ , Ay = I , A" = 1 (3.79) 
\010/ \-100/ \000/ 

span 30 (3), and generate the one-parameter subgroups 

/I \ / sin^ \ / cos^ -sin^ 

i?^= cos^ -sine , i?^= 1 , i?^ = sinO cosO 
\0 sin^ cos^ / \ -sin^ cos^ / \ 1 

(3.80) 

representing anticlockwise rotation around the respective coordinate axes in the phys- 
ical space IR^. 

Having computed so (3) to consist of antisymmetric matrices, we can use Section 
l3.2.1[ A's correspondence between these and vectors in IR^ so as to realize so(3) as 
vectors with the Lie bracket as vector multiplication. With these realizations in hand, 
we can readily obtain several further results about rotations. We will not need any. 
But a good example, which uses eq. I3.20f s isomorphism Q from vectors u G IR^ to 
matrices A G 30 (3), is as follows: — 

exp(r Q{uj)) is a rotation about the axis uj by the angle r || ||. 

We can now begin to see the point of this Chapter's second motto (from Arnold), 
that the elementary theory of the rigid body confuses six conceptually different three- 
dimensional spaces. For our discussion has already distinguished three of the six spaces 
which Arnold lists (in a different notation). Namely, we have just distinguished: 

(i) IR^, especially when taken as physical space; from (ii) 3o(3) = Te{S0{3)), the 
generators of rotations; though they are isomorphic as Lie algebras, by eq. l3.2(Jt s bi- 
jection from vectors u G IR^ to matrices A G 30 (3); 

(ii) 30 (3) = Te{S0{3)) from its isomorphic copy under the derivative of left trans- 
lation by g (i.e. under (i^c,)*), viz. Tg{S0{3)): cf. eq. 13.411 (In the motto, Arnold 
writes g for 3o(3) and G for S0{3).) 

In Section 15.2.41 we will grasp (even without developing the theory of the rigid 
body!) the rest of the motto. That is, we will see why Arnold also mentions the 



three corresponding dual spaces, ]R^*,so(3)* and T*{S0{3)). But we can already say 
more about the two tangent spaces 50 (3) = Te{S0{3)) and Tg{S0{3)), in connection 
with the idea that for a pivoted rigid body, the configuration space can be taken as 
5*0(3); (cf. (3) of Section f2.2|) . We will show that there are two isomorphisms from 
Tg{S0{3)) to Te{S0{3)) that are natural, not only in the mathematical sense of being 
basis-independent but also in the sense of having a physical interpretation. Namely, 
they represent the computation of the angular velocity from the Lagrangian generalized 
velocity, i.e. q. In effect, one isomorphism computes the angular velocity's components 
with respect to an orthonormal frame fixed in space (called spatial coordinates); and the 
other computes it with respect to a frame fixed in the rigid body {body coordinates) . In 
fact, these isomorphisms are the derivatives of right and left translation, respectively; 
(cf. eq. EMandEM- 

So suppose a pivoted rigid body has a right-handed orthonormal frame {a, b, c} fixed 
in it. We can think of the three unit vectors a, b, c as column vectors in JR^. Arranging 
them in a 3 X 3 matrix g := {a b c) & GL{3, IR), we get a matrix that maps the unit x- 
vector ei to a, the unit vector 62 to b, etc. That is: g maps the standard frame ei, 62, 63 
to a, b, c, and g is an orthogonal matrix: g G 5*0(3) = {g E GL{3, JR) \ gg = I }. Thus 
g represents the configuration of the body, and the configuration space is 5*0(3). 

By differentiating the condition gg = I, we deduce that the tangent space at a 
specific g Tg{S0{3)), i.e. the space of velocities g, is the 3-dimensional vector subspace 
of GL(3,1R): 

Tg(S0(3)) = {ge GL(3, IR) | ~g9 + ~99 = } (3.81) 

Now recall examples (ii) and (iii) of Section [3.2. ll A. We saw there that though the 
angular velocity of the body is usually taken to be the vector uj such that, with our 
"body-vectors" a, b, c, 

d = ti;Aa, b = uj Ab, c = u Ac : (3.82) 

we can instead encode the angular velocity by the antisymmetric matrix A := 0(u;) G 
Q = Te(50(3)). As we saw, eq. 13.821 then becomes 

d = e{uj)a, b = Q{uj)b, c = Q{uj)c : (3.83) 

or equivalently the matrix equation for the configuration g = {a b c), 

g^^abc) = e{uj)g ; i.e. e{uj) = gg-' . (3.84) 

Thus we see that the map from Tg{S0{3)) to g = Te(5*0(3)) 

g G T,(50(3)) ^ gg~' = gg e g (3.85) 

maps the generalized velocity g to the angular velocity Q{u). This is the angular 
velocity represented in the usual elementary way, with respect to coordinates fixed in 
space. One immediately checks that it is an isomorphism (exercise!). 

On the other hand, let us consider Q{uj) as a linear transformation Q{uj) : IR^ IR^, 



and express it in the body coordinates a,b,c. This gives g ^Q{uj)g = g ^g. Thus the 
map 

g E Tg{S0{3)) ^ g~'g = ~gg e Q (3.86) 

maps the generahzed velocity g to the angular velocity expressed in body coordinates. 
It also is clearly an isomorphism. 

Summing up: we have two natural isomorphisms that compute the angular velocity, 
in spatial and body coordinates respectively, from the generalized velocity g. 

Incidentally, one can verify directly that the images gg and gg of the isomorphisms 
eq. 13.851 and 13.861 lie in g, i.e. are antisymmetric matrices. Thus with ■ for the 
elementary dot-product, we have: 

(a \ a ■ b a ■ c \ 

b \ {db c) = I b- d 6-c . (3.87) 
c J \ c-d c-b 0/ 

This is an antisymmetric matrix, since differentiating a-b = b- c = a- c = with 
respect to time gives a ■ b + d ■ b = etc. Finally, we deduce that gg is antisymmetric 
from the facts that gg = g{g^^g)g^^ and antisymmetry is preserved by conjugation by 
9- 

We end this Subsection with two incidental remarks; (they will not be used in what 
follows) . 

(1) : In Section f2.1.1l we could have specialized the discussion from a symplectic 
manifold to a symplectic vector space, i.e. a (real, finite-dimensional) vector space 
equipped with a non-degenerate anti-symmetric bilinear form u : Z x Z ^ JR. It 
follows that Z is of even dimension. The question then arises which linear maps 
A : Z Z preserve the normal form of u given by eq. 12.41 It is straightforward to 
show that this is equivalent to A preserving the form of Hamilton's equations (for any 
Hamiltonian) ; so that these maps A are called symplectic (or canonical, or Poisson). 
The set of all such maps form a Lie group, the symplectic group, written Sp{Z,u). 
But since this Chapter will not need the theory of canonical transformations, I leave 
the study of Sp(Z, u;)'s structure as an exercise! (For details, cf. e.g. Abraham and 
Marsden (1978: 167-174), Marsden and Ratiu (1999: 69-72, 293-299).) 

(2) : Finally, a glimpse of the infinite-dimensional manifolds that this Chapter has 
foresworn. Consider the infinite-dimensional Lie group Diff{M) of all diffeomorphisms 
on M. An element of its Lie algebra, i.e. a vector A G Te{Dif f{M)), is a vector field, 
or equivalently a flow, on M. Besides, the Lie bracket in this Lie algebra Te{Dif f{M)), 
as defined by eq. 13.451 turns out to be the usual Lie bracket of the vector fields on M, 
as defined in Section 13.2.21 



4 Actions of Lie groups 

We turn to actions of Lie groups on manifolds. The notions, results and examples 
in this Section will be crucial from Section 15.41 onwards. Fortunately, the foregoing 



provides several examples of the notions and results we need. Section H^Tl will give basic 
material, including the crucial notion of cotangent lifts. Sections 14.21 and 14.31 describe 
conditions for orbits and quotient spaces to be manifolds. Section I^Til describes actions 
infinitesimally, i.e. in terms of their infinitesimal generators. Section presents two 
important representations of a Lie group, its adjoint and co-adjoint representations, 
on its Lie algebra g and on the dual g* respectively. Finally, Section 14.61 gathers some 
threads concerning our central, recurring example, viz. the rotation group. 

4.1 Basic definitions and examples 

A left action of a Lie group G on a manifold M is a smooth map ^ : G x M —>■ M 
such that: 

(i) : $(e, x) = X for all x G M 

(ii) : ^{g, x)) = ^{gh, x) for all g,h e G and all x e M. 
We sometimes write g ■ x for ^{g, x). 

Similarly, a right action of a Lie group G on a manifold M is a smooth map 
\l/ : M X G ^ M satisfying (i) e) = x and (ii) \l>(\l>(x, gf), /i) = '^{x,gh). We 
sometimes write x ■ g for \I^(a;,5'). 

It is convenient to also have a subscript notation. For every g & G, we define 

^g-.M^M : x^^g,x). (4.1) 

In this notation, (i) becomes $6 = idM and (ii) becomes = $g o For right 
actions, (ii) becomes "^gh = o vl/^. 

One immediately verifies that any left action $ of G on a manifold M, g ^-^ ^g : 
M M, defines a right action \1/ by 

^ := <|)^_i : M ^ M ; i.e. ^ : {x, g) e M x G ^ ^{g^^, x) E M . (4.2) 

(Use the fact that in G, {gh)~^ = h^^g^^.) Similarly, a right action defines a left action, 
by taking the inverse in G. We will occasionally make use of this left-right "flip" . 

The definition of left action is equivalent to saying that the map g ^ ^g is a 
homomorphism of G into Diff(M), the group of diffeomorphisms of M. In the special 
case where M is a Banach space V and each ^g : V ^ V is a. continuous linear 
transformation, the action of G on is called a representation of G on V. 

The orbit of x E M (under the action $) is the set 

Orb(a;) = {^g{x) : g E G} C M. (4.3) 

The action is called transitive if there is just one orbit, i.e. for all x,y E M, there is a 
g E G such that g ■ x = y. It is called effective (or faithful) if ^g = id^/ implies g = e, 
i.e. if g ^ ^g is one-to-one. It is called free if it has no fixed points for any g ^ e: that 
is, ^g{x) = X implies g = e. In other words, it is free if for each x E M, g \—>- ^g{x) is 
one-to-one. (So: every free action is faithful.) 

.c^7 



4.1. A Examples; cotangent lifts We begin witli geometric examples; and then 
return to mechanics, giving first some general theory, followed by some examples. 

(1): Geometric examples: — 

(1) : S0{?)) acts on IR^ by {A^x) ^ Ax. The action is faithful. But it is neither free 
(each rotation fixes the points on its axis) nor transitive (the orbits are the spheres 
centred at the origin). 

(ii) : GL(n, H) acts on IR" by (A, x) t— > Ax. The action is faithful, not free, and 
"almost transitive": the zero subspace {0} is an orbit, and so is — {0}. 

(iii) : Suppose X is a vector field on M which is complete in the sense that the fiow 
4>x{t) of eq. 13.71 is defined for all r G IR. Then this fiow defines an action of IR on M. 

We turn to two examples which will be central, and recurring, in our discussion of 
symplectic reduction. 

(iv) : Left translation by each g E G, Lg : h E G gh E G (cf eq. I3.39p . defines a 
left action of G on itself. Since G is a group, it is transitive and free (and so faithful). 
Similarly, right translation, g \—>- Rg with Rg : h E G ^ hg E G, defines a right action. 
And g h-* -Rg-i defines a left action; cf. eq. 14.21 

One readily proves that left translation lifts to the tangent bundle TG as a left 
action. That is: one verifies by the chain rule that 

<l>g:TG^TG : v = v^, E ^G ^ {nLg){v) E Tgj^G (4.4) 

defines a left action on TG. Similarly, right translation lifts to a right action on 
TG. But our interest in Hamiltonian mechanics of course makes us more interested 
in cotangent lifts. See (2) below for the general definitions, and example (viii) in (3) 
below for the cotangent lift of left translation. 

(v) : G acts on itself by conjugation (inner automorphism): g ^-^ Kg := Rg-i o Lg. 
That is: Kg : h E G ^ ghg~^ E G. Each Kg is an isomorphism of G. The orbits 
are conjugacy classes. Section 1^31 will introduce two "differentiated versions" of action 
by conjugation, viz. the adjoint and co-adjoint actions, which will be important in 
symplectic reduction. 

(2) : Hamiltonian symmetries and cotangent lifts: — 

We turn to Hamiltonian mechanics. Following the discussion in Section 12.1.31 we say: 
given a Hamilton system (M, u, H) with (M, a;) a symplectic manifold and H : M 
IR, a Hamiltonian group of symmetries is a Lie group G acting on M such that each 
$g : M — ^ M preserves both uj and H. Then the simplest possible examples are spatial 
translations and-or rotations acting on the free particle. The details of these examples, 
(vi) and (vii) below, will be clearer if we first develop some general theory. 

This theory will illustrate the interaction between the left-right contrast for actions, 
and the tangent-cotangent contrast for bundles. Besides, both the general theory and 
the examples' details will carry over straightforwardly, i.e. component by component, 
to the case of N particles interacting by Newtonian gravity, discussed in Section r2. 3. 21 
the action defined on a single particle is just repeated for each of the N particles. 

So we will take M := (IR^) x (IR^)*, to := dq' A dp\ H := p^ /2m. In the first place. 



both translations (by x G IR'^) and rotations (by A G 5*0(3)) act on the configuration 
space Q = H^. We have actions of IR^ and 5*0(3) on JR^ by 

'^'x(q) = q + X ; <l>^(q) = Aq . (4.5) 

But these actions lift to the cotangent bundle T*Q = (IR^) x (H^)* ^ IR^; (as 
mentioned in Section [23121) • The lift of these actions is defined using a result that does 
not use the notion of an action. Namely: 

Any diffeomorphism f '■ Qi —>■ Q2 induces a cotangent lift T*f : T*Q2 
T*Qi (i.e. in the opposite direction) which is symplectic, i.e. maps the 
canonical one- form, and so symplectic form, on T*Q2 to that of T*Qi. 

To define the lift of an action, it is worth going into detail about the definition of T*f. 
(But I will not prove the result just stated; for details, cf. Marsden and Ratiu (1999: 
Section 6.3).) 

The idea is that T*f is to be the "pointwise adjoint" of the tangent map Tf : 
TQi TQ2 (eq. 13. That is: we define T* f in terms of the contraction of its value, 
for an arbitrary argument a G T*^Q2, with an arbitrary tangent vector v G Tf-i(^q^^Qi. 
(Here it will be harmless to (follow many presentations and) confiate a point in T*Q2, 
i.e. strictly speaking a pair {q2,a),q2 G Q2,C( G T*^Q2, with its form a. And similarly 
it will be harmless to confiate a point {qi,v) in TQi with its vector v G Tq^Qi.) 

We recall that any finite-dimensional vector space is naturally, i.e. basis-independently, 
isomorphic to its double dual: {V*)* = V; and we will use angle brackets < ; > for 
the natural pairing between V and V*. So we define T*f; T*Q2 T*Qi by requiring: 

< {TJ){ay,v > := < a;{Tf){v) >, V a G T^Qs, G T^-i(,,)Qi . (4.6) 

NB: Because T*f "goes in the opposite direction" , the composition of lift with function- 
composition involves a reversal of the order. That is: if Qi = Q2 = Q and /, g are two 
diffeomorphisms of Q, then 

T*{fog)=T*goT*f. (4.7) 

With this definition of T*/, a left action $ of G on the manifold Q induces for each 
g & G the cotangent lift of : Q ^ Q. That is: we have the map 

T*^g = T*{^g) : T*Q T*Q, with a G T;Q ^ (r*<l>3)(«) G T;-i . . (4.8) 

Now consider the map assigning to each g E G, T*^g\ 

geG^ T*^g : T*Q T*Q . (4.9) 

To check that this is indeed an action of G on T*Q, we first check that since $e = idg, 
T$e : TQ — > TQ is idrq and T*($e) is idT*Q- But beware: eq. 14.71 vields 

T*^g^, = T*($3 o $^) = T*<l>^ o T*^g , (4.10) 



so that eq. 14.91 defines a right action. 

But here we recall that any left action defines a right action by using the inverse; 
cf. eq. 14.21 Combining this with the idea of the cotangent lift of an action on Q, we 
get: 

The left action $ on Q defines, not only the right action eq. 14.91 on T*Q, but also 
a left action on T*Q, viz. by 

g eG^^g:=T*{^g-i) ■.T*Q ^T*Q . (4.11) 

For since (gh)^^ = h^^g^^, 

= = T*(<l>,-i,-i) = T*($,-i o = T*V^ ° T*<!>f,-i = o . 

(4.12) 

In short, the two reversals of order cancel out. This sort of left-right flip will recur in 
some important contexts in the following, in particular in Sections 16.51 and [7| 

(3): Mechanical examples: — 
So much by way of generalities. Now we apply them to translations and rotations of a 
free particle, to rotations of a pivoted rigid body, and to point-particles. 

(vi): Let the translation group G = (IR^, -|-) act on the free particle's configuration 
space Q = IR^ by 

$,(q)=q + x. (4.13) 

Since G is abelian, the distinction between left and right actions of G collapses. (And 
if we identify G with Q, this is left=right translation by IR^ on itself, i.e. example (iv) 
again: and so transitive and free.) But of course the lifted actions we have defined, 
"with g" and "with g~^" , eq. 14.91 and [4.1 II respectivelv. remain distinct actions. 

Then, writing a = (q, p) G T*Q, and using the fact that T$x(q — x, q) = (q, q), 
we see that eq. 14.61 implies that: first, 

T*(<fx)(q,p)GTq*_,Q; (4.14) 

and second, that for all q G Tq_x<5, 

<T*($x)(q,p);(q-x,q) > = < (q,p);(q,q)> = p(q) • (4.15) 

For eq. 14. 151 to hold for all q G Tq_x<5 requires that T*($x)(q, p) does not affect p, i.e. 

T*($x)(q,p) = (q-x,p). (4.16) 

So this is the lifted action "with g" , corresponding to eq. 14.91 Similarly, the lifted action 
"with g~^'\ corresponding to eq. 14.111 is: \E'x(q, p) := r*('&-x)(q, p) = (q + x, p). 

One readily checks that these lifted actions preserve both uj = dq^ A dp* (an exercise 
in manipulating the exterior derivative) and H := /2m. So we have a Hamiltonian 
symmetry group. The action is not transitive: the orbits are labelled by their values 
of p G (IR^)*. But it is free. 

fin 



(vii): Let SO (3) act on the left on Q = by 



<l>A(q) = v4q. (4.17) 

(This is example (i) again.) Let us lift this action "with g" , i.e. eq. 14.91 so as to get a 
right action on T*Q. 

As in example (vi), we write a = (q, p) G T*Q. Using the fact that T$A(q, q) = 
{Aq,Aq), eq. 14.61 then implies that: first, 

T*(<fA)(q,p)GT;_,qQ; (4.18) 

and second, that for all q G T^-iqQ, 

<T*($^)(q,p);(A-iq,q) > = < (q, p); (q, Aq) > = p(Aq) = PiA)q^ . 

(4.19) 

For eq. 14.191 to hold for all q G T^-iqQ requires that 

T*(<|.^)(q,p) = (A-^q,p/l), (4.20) 

where pA is a row-vector. Or if one thinks of the p components as a column vector, it 
requires: 

r*($^)(q, p) = (A-iq, ip) = (A-iq, ^-^p) , (4.21) 

where ~ represents the transpose of a matrix, and the last equation holds because A is 
an orthogonal matrix. 

So this is the lifted action "with g''\ corresponding to eq. 14.91 Similarly, the 
lifted action "with g~^'\ corresponding to eq. 14.111 is: \l'A(q, p) := T*($^-i)(q, p) = 
(^q, Ap). 

Again, one readily checks that these lifted actions preserve both uo = dq^ A dp^ 
(another exercise in manipulating the exterior derivative!) and H := /2m. So 5*0 (3) 
is a Hamiltonian symmetry group. 

Like the original action of S'0(3) on Q, these actions are faithful. But they are not 
transitive: the orbits are labelled by the radii of two spheres centred at the origins of 

and (IR^)*. And they are not free: suppose q and p are parallel and on the axis of 
rotation of A. 

(viii): Now we consider the pivoted rigid body. But unlike examples (vi) and (vii), 
we will consider only kinematics, not dynamics: even for a free body. That is, we will 
say nothing about the definitions of, and invariance of, uj and if; for details of these, 
cf. e.g. Abraham and Marsden (1978: Sections 4.4 and 4.6) and the other references 
given in (3) of Section IT^ We will in any case consider the dynamics of this example 
in more general terms (using momentum maps) in Sections 16.5.31 and [7| 

We recall from the discussion at the end of Section 13.4.41 that the configuration 
space of the pivoted rigid body is 5*0(3) =: G. We also saw there that the space and 
body representations of the angular velocity v = g E TgG are given by right and left 
translation. Thus eq. 13.851 and 13.861 give: 

= g'' ■.= TgRg-i{g) and v"" = g"" := TgLg-.{g) . (4.22) 

fil 



But we are now concerned with the cotangent hft of left (or right) translation. So let 
5'0(3) act on itself by left translation: ^gh = Lgh = gh. Let us lift this action "with g" , 
i.e. eq. 14.91 to get a right action on T*G. So let a G T^G and (TLg){h, h) = {gh,gh). 
Then eq. 14.61 implies that: first 

(T*L,)(a)Gr;_.,G, (4.23) 

and second that for all v G Tg-i^G 

< T*{Lg){a)]v > = < a;gv > . (4.24) 

In other words, on analogy with eq. I4.16l and l4.20l for eq. l4.24l to hold for all v G Tg-i^G 
requires that with gv G ThG: 

T*{Lg){a) : v G Tg-ij,G ^ a{gv) . (4.25) 

Similarly, the lifted action "with (7"^" corresponding to eq. 14.111 i.e. the left action on 
T*G, is 

< T*{Lg-i){a)]v > = < a;g-\ > ,Wa E UG,v E TghG (4.26) 

We will continue this example in Section 14.61 after developing more of the theory of 
Lie group actions. 

Finally, let us sketch another mechanical example: the case of particles with 
configuration space Q := IR^^ interacting by Newtonian gravity — discussed in Section 
12.3.21 This will combine and generalize examples (vi) and (vii); and lead on to the 
next Sections' discussions of orbits and quotients. 

(ix): As I mentioned above (before eq. 14. 5|) . the cotangent-lifted actions of trans- 
lations and rotations on a single particle carry over straightforwardly to the case of 

particles: the action defined on a single particle is just repeated, component by 
component, for each of the A^ particles to give an action on T*Q = IR^^ X (IR^^^)*. 

Furthermore, the groups of translations and rotations are subgroups of a single 
group, the Euchdean group E. I shall not define E exactly. Here, let it suffice to say 
that: 

(a) : E^s component-wise action on the configuration space Q := IR^^ has a cotan- 
gent lift, which is of course also component by component. 

(b) : £"s cotangent-lifted action is not transitive, nor free; but it is faithful. 

(c) : If we take as the Hamiltonian function the H of eq. 12.251 describing the parti- 
cles as interacting by Newtonian gravity, then ii^ is a Hamiltonian symmetry group. In 
fact, the kinetic and potential energies are separately invariant, essentially because the 
particles' interaction depends only on the inter-particle distances, not on their positions 
or orientations; cf. the discussion in Section [2.3.21 

A final comment about example (ix), which points towards the following Sections: — 
Recall that in Sections 12.3.31 and 12.3.41 we used this example as a springboard to 
discussing Relationist and Reductionist procedures, which quotiented the configuration 
space or phase space. But in order for the quotient spaces (and orbits) to be manifolds, 
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and in particular for dimensions to add or subtract in a simple way, we needed to excise 
two classes of "special" points, before quotienting. These were: the class of symmetric 
configurations or states (i.e. those fixed by some element of E), and the class of 
collision configurations or states. For the quotienting of phase space advocated by 
Reductionism, the classes of states were S C T*JR^^ and A C T*JR^^; (cf. Section 
12.3.41 for definitions.) 

With examples (vi) to (ix) in hand, we can now see that: 

(a) : 5 and A are each closed under the cotangent-lifted action of E on T*1R^^; 
i.e., each is a union of orbits. So E acts on M := T*1R'^^ — (5 U A). 

(b) : E acts freely on M. 

We will see in the sequel (especially in Sections I4.3I B and l5.5|) that an action being free 
is one half (one conjunct) of an important sufficient condition for orbits and quotient 
spaces to be manifolds. The other conjunct will be the notion of an action being proper. 
which we will define in Section 1^751 



4.2 Quotient structures from group actions 

In finite dimensions, any orbit Orb(x) is an immersed submanifold of M. This can 
be proved directly (Abraham and Marsden (1978: Ex. 1.6F(b), p. 51, and 4.1.22 p. 
265)). But for our purposes, this is best seen as a corollary of some conditions under 
which quotient structures are manifolds; as follows. 

The relation, x = y if there is a (? G G such that g-x = y, is an equivalence relation, 
with the orbits as equivalence classes. We denote the quotient space, i.e. the set of 
orbits, by M/G (sometimes called the orbit space). We write the canonical projection 
as 

71 : M ^ M/G, X ^ Orb(a;) ; (4.27) 

and we give M/G the quotient topology by defining U C M/G to be open iff n^^{U) 
is open in M. 

Simple examples (e.g. (ii) of Section UTT] A) show that this quotient topology need 
not be Hausdorff. However, it is easy to show that if the set 

R ■= {(x, <^gx) eM X M : {g,x) EG X M} (4.28) 

is a closed subset of M x M, then the quotient topology on M/ G is Hausdorff. 

But to ensure that M/G has a manifold structure, further conditions are required. 
The main one (and a much harder theorem) is: 

R is a closed submanifold of M x M iff M/ G is a manifold with tt : M — 
M/G a submersion. 

This theorem has two Corollaries which are important for us. 

(1): A map h : M/G N, from the manifold M/G, for which n : M M/G is a 
submersion, to the manifold N, is smooth iff /i o tt : M ^ is smooth. 



This corollary has a useful implication, called passage to the quotients, about the 
notion of equivariance — which will be important in symplectic reduction. 

A smooth map / : M — > is called equivariant if it respects the action of a Lie 
group G on the manifolds. That is: Let G act on M and N hj : M ^ M and 

: N ^ N respectively. / : M — > is called equivariant with respect to these 
actions if for all g & G 

/ o $^ = v[/^ o /. (4.29) 
That is, / is equivariant iff for all g, the following diagram commutes: 

M N (4.30) 



*9 



*9 



M N 

Equivariance immediately implies that / naturally induces a map, / say, on the quo- 
tients. That is: the map 

/ : Orb(x) G M/G ^ Orb(/(x)) G N/G (4.31) 

is well-defined, i.e. independent of the chosen representative x for the orbit. 

Applying the corollary we have: If / : M ^ is equivariant, and the quotients 
M/G and M/N are manifolds with the canonical projections both submersions, then 
/ being smooth implies that / is smooth. This is called passage to the quotients. 

(2): Let if be a closed subgroup of the Lie group G. (By (2) of Section 13.4.31 this 
is equivalent to H being a subgroup that is a submanifold of G.) Let H act on G by 
left translation: {h, g) E H x G ^ hg E G, so that the orbits are the right cosets Hg. 
Then G/H is a. manifold and n : G ^ G/H is a submersion. 



4.3 Proper actions 

By adding to the Section H?^ s main theorem (i.e., i? is a closed submanifold of M x M 
iff M/G is a manifold with n : M —>■ M/G a submersion), the notion of a proper action 
we can give useful sufficient conditions for: 

(A) : orbits to be submanifolds; 

(B) : M/G to be a manifold. 

An action $ : G x M — > M is called proper if the map 

^ : {g,x) e G X M ^ (x, <l>(^, x)) G M x M (4.32) 

is proper. By this we mean that if is a convergent sequence in M, and {$g„(a;„)} 
is a convergent sequence in M, then {gn} has a convergent subsequence in G. In 
finite dimensions, this means that compact sets have compact inverse images; i.e. if 
K G M X M is compact, then ^~^{K) is compact. 

If G is compact, this condition is automatically satisfied. Also, the action of a 



group on itself by left (or by right) translation (Example (iv) of Section I4.11 A) is 
always proper. Furthermore, the cotangent lift of left (or right) translation ((2) and 
Example (viii) of Section I4.1I A) is always proper. We shall not prove this, but it will 
be important in the sequel. 

4. 3. A Isotropy groups; orbits as manifolds For x G M the isotropy (or stabilizer 
or symmetry) group of $ at x is 

G,:={geG: $,(x) = ^g, x) = x} C G. (4.33) 

(So an action is free iff for all x G M, Gx = {e}.) 
So if we define 

<I>^':G^M : $^(c/) := $(c/,x) (4.34) 

we have: Gx = ($^)^^(x). (The notation $^ is a "cousin" of the notation $g defined 
in eq. 14.11 ) 

So since is continuous, Gx is a closed subgroup of G. So, by the result in (2) 
of Section 13.4.31 fi.e. the result before eq. I3.69p . Gx is a submanifold (as well as Lie 
subgroup) of G. And if the action is proper, Gx is compact. 

Furthermore, the fact that for all h G Gx we have ^^{gh) = $g o ^^{x) = ^g{x), 
implies that $^ naturally induces a map 

: [g] = gGx G G/Gx ^ <^gX G Orb(a;) C M . (4.35) 

That is, this map is well-defined. is injective because if = then g^^h G Gx, 
so that gGx = hGx- 

It follows from Section l^^ s main theorem (i.e., i? is a closed submanifold of M x M 
iff M/G is a manifold with vr : M — > M/G a submersion) that: 

(a) : If $ : G X M — i> M is an action and x G M, then defined by eq. 14.351 is an 
injective immersion. 

Here we recall from Section 13. 3. II that injective immersions need not be embeddings. 
But:— 

(b) : If also $ is proper, the orbit Orb(x) is a closed submanifold of M and is 
a diffeomorphism. In other words: the manifold structure of Orb(x) is given by the 
bijective map [g] G G/Gx ^ g ■ x E Orb(x) being a diffeomorphism. 

Examples: — 

(We use the numbering of corresponding examples in Section UTTl A): — 

(i): G = S0{3) acts on M = by {A,x) i-^ Ax. Since Orb(x) is a sphere centred 
at the origin of radius || x ||, M/G = which is not a manifold. But results (a) and 
(b) are illustrated: the isotropy g roup Gx 

X is the group of rotations with x on the 
axis; the action is proper (for G is compact); the orbit Orb(x) is a closed manifold of 
M; and the isotropy group's cosets [g] G G/Gx are mapped diffeomorphically by to 
points on the sphere Orb(x). 

(iii): Let X be the constant vector field dx on M = IR^. X is complete. The action 



of ]R on M has as orbit through the point x = {x, y, z) G H^, the hne y = constant, z = 
constant. The action is free, and therefore faithful and the isotropy groups are triviaL 
So G/Gx = G. The action is proper. Again resuhs (a) and (b) are illustrated: the 
orbits Orb(x) are closed submanifolds of M, viz. copies of the real line = G = G/Gx 
that are diffeomorphic to IR by 

4.3.B A sufficient condition for the orbit space M/G to be a manifold With 
result (b) from the end of Section 1^751 A., we can prove that: 

If $:G'xM^M is a proper free action, then the orbit space M/G is a 
manifold with vr : M — M/G a submersion. 

Examples: (again using the numbering in Section I^ITI A): — 

(i): G = S0{3) acts on M = IR^ by {A,x) i— >• Ax. Since Orb(x) is a sphere centred 
at the origin of radius || x ||, M/G = IR^: which is not a manifold, and indeed the 
action is not free. 

(iii) : Let X be the constant vector field on M = JR^. X is complete, and the 
action of IR on M has as orbits the lines y = constant, z = constant. The action is 
faithful, free and proper, so that the orbit space M/G is a manifold: M/G = IR^. 

(iv) : Left (or right) translation is obviously a free action of a group G on itself, and 
we noted above that it is proper. But since it is transitive, the orbit space G/G is the 
trivial 0-dimensional manifold (the singleton set of G). 

(viii) : The cotangent lift of left (or right) translation by 5*0(3), or more generally, 
by a Lie group G. This action is proper (noted after eq. I4.32|) . and obviously free. 

(ix) : The Euclidean group E acts freely on M := T*1R^^ — (5 U A). This action is 
also proper: a (harder!) exercise for the reader. 

4.4 Infinitesimal generators of actions 

We now connect this Subsection's topic, group actions, with the Lie algebra of the Lie 
group concerned, i.e. with the topic of Section IT^ especiallv 13.4.21 

Let $:G'xM— s>Mbea (left) action by the Lie group G on a manifold M. Then 
each ^ G defines an action of IR on M, which we write as in the following way. 

We can think either in terms of exponentiation of i^'s corresponding left-invariant 
vector field (cf. eq. 13.81 and l3.47|) : or in terms of of exponentiating ^ itself (cf. eq. 
EUniandEini): 

$^ : IR X M ^ M : $^(r, x) := <l>(exp(rX5), x) = $(exp(rO, x). (4.36) 

That is, in terms of our subscript notation for the original action $ (cf. eq. 14.11) : 

<^'exp(rx^) = <^'oxp(rO : M ^ M Is a flow ou M. 

That the flow is complete, i.e. that an action of all of IR is deflned, follows from 
(2) Exponentiation again of Section 13.4.21 especially after eq. 13.481 Cf. also example 
(iii) of Section H?TJ 

fifi 



We say that the corresponding vector field on M, written ^m, i-e. the vector field 
defined at x G M by 



Cm{x) := — 1^=0 ^cMrXi.){x) = — \r=0 <^c^p(ro{x) (4.37) 

is the infinitesimal generator of the action corresponding to ^. 

In terms of the map defined in eq. 14.341 we have that for all ^ G g 

^m{x) = {Te^mO ■ (4.38) 

So NB: the words 'infinitesimal generator' are used in different, though related, 
ways. In Remark (2) at the end of Section I3.4.2| a vector field on the group G, or an 
element ^ G g, was called an 'infinitesimal generator'. Here the infinitesimal generator 
is a vector field on the action-space M. Similarly, beware the notation: is a vector 
field on M, while is a vector field on G. 

As an example, we again take the rotation group S0{3) acting on IR'^: {A,x) G 
S0{3) X IR Ax. One readily checks that with u G IR^, so that 6(a;) G so(3), the 
infinitesimal generator of the action corresponding to ^ = <d{uj) is the vector field on 
IR^ 

eH3(x)^(eH)i,3(x)=a;Ax. (4.39) 

In particular, the vector field on IR'^ representing infinitesimal anti-clockwise rotation 
about the x-axis is ei := ydz — zdy (cf. eq. I3.79|) . Similarly, the infinitesimal generators 
of the action of rotating about the y axis and about the z-axis are, respectively: 62 : = 
zdx — xdz and 63 := xdy — yd^. The Lie brackets are given by: 

[61,62] = -63 [63,61] = -62 [62,63] = -61. (4.40) 

The minus signs here are a general feature of the transition ^ G g 1-^ S,m ^ '^(^); cf. 
result (4) below. 

As another example, we take the infinitesimal generator of left and right transla- 
tion on the group G. (We will need this example for our theorems about symplectic 
reduction; cf. Sections 16.5.31 17^ and 17^3.31 ) NB: There will be a "left-right fiip" here, 
which continues the discussion in (4) of Section [3. 4. 2[ B. comparing using left-invariant 
vs. right-invariant vector fields to define the Lie algebra of a Lie group. 

For left translation ^{g,h) = Lgh := gh, we have for all ^ G g: 

$«(r,/i) = {exprOh = i?,(exprO ; (4.41) 

so that the infinitesimal generator is 

^aig) = {TeRg)i . (4.42) 

So is a ng'/it-invariant vector field; and unless G is abelian, it is not equal to the 
left-invariant vector field g t— > X^{g) := {TeLg)^; cf. eq. l3.4Ul and 13.421 
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Similarly, for right translation (which is a right action, cf. (1) (iv) in Section lTTl A). 
the infinitesimal generator is the left-invariant vector field 



g ^ X^{g) := (T^L^)^ . 



(4.43) 



Three straightforward results connect the notion of an infinitesimal generator with 
previous ideas. I will not give proofs, but will present them in the order of the previous 
ideas. 

(1): Recall the correspondence between Lie subgroups and Lie subalgebras, at the 
end of Section 13.4.31 eq. 13.691 This implies that the Lie algebra of the isotropy group 
Gx,x & M (called the isotropy algebra), is 



(2) : Infinitesimal generators give a differential version of the notion of equivari- 
ance, discussed in (1) of Section a version called infinitesimal equivariance. 

In eq. 14.291 we set g = exp(r^) and differentiate with respect to r at r = 0. This 
gives Tf o = ° f ■ That is: and ^jy are /-related. In terms of the pullback /* 
of /, we have: f*^N = 6/- 

(3) : Suppose the action $ is proper, so that by result (b) at the end of Section 
I4.31 A: the orbit Orb(x) of any point x G M is a (closed) submanifold of M. Then the 
tangent space to Orb(x) at a point y in Orb(x) is 



Finally, there is a fourth result relating infinitesimal generators to previous 
ideas; as follows. (But it is less straightforward than the previous (l)-(3): its proof 
requires the notion of the adjoint representation, described in the next Section.) 

(4): The infinitesimal generator map C, ^— ^ establishes a Lie algebra anti- 
homomorphism between g and the Lie algebra Xm of all vector fields on M. (Contrast 
the Lie algebra isomorphism between q and the set Xl{G) of left-invariant vector fields 
on the group G; Section 13.4.21 especiallv eq. 13.421 ) That is: 

(a^ + br])M = <M + bT]M ; [^m, Vm] = -[^^ v]m ^7 G 0, and a,b eH. (4.46) 

Incidentally, returning to (4) of Section 13.4.21 B. which considered defining the Lie 
algebra of a Lie group in terms of right-invariant vector fields, instead of left-invariant 
vector fields: had we done so, the corresponding map ^ i— > C,m would have been a Lie 
algebra homomorphism. 

4.5 The adjoint and co-adjoint representations 

A leading idea of later Sections (especially Sections l5.4116.41 andl7|) will be that there is 
a natural symplectic structure in the orbits of a certain natural representation of any 



0x = {e e : ^m{x) = 0} . 



(4.44) 



TOrb(a;), = {6,(2/) : ^ e g} . 



(4.45) 
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Lie group: namely a representation of the group on the dual of its own Lie algebra, 
called the co-adjoint representation. Here we introduce this representation. But we 
lead up to it by first describing the adjoint representation of a Lie group on its own 
Lie algebra. Even apart from symplectic structure (and so applications in mechanics), 
both representations illustrate the ideas of previous Subsections. I will again use S0{?>) 
and so (3) as examples. 

4.5.1 The adjoint representation 

We proceed in four stages. We first define the representation, then discuss infinitesimal 
generators, then discuss matrix Lie groups, and finally discuss the rotation group. 

(1): The representation defined: — 
Let G be a Lie group and g its Lie algebra, i.e. the tangent space to the group at the 
identity e G G, equipped with the commutator bracket operation [, ]. 

Recall (e.g. from the beginning of Section I3.4.2j) that G acts on itself by left and 
right translation: each g & G defines diffeomorphisms of G onto itself by 

Lg-.heG^gheG; Ry : h e G ^ hg e G. (4.47) 

The induced maps of the tangent spaces are, for each h E G: 

Lg, : TGh ^ TGgh and Rg, : TG^ ^ TGhg. (4.48) 

The diffeomorphism Kg := Rg~i o Lg (i.e. conjugation hj g, Kg : h ghg~^) is an 
inner automorphism of G. (Cf. example (v) at the end of Section I^TTl ) Its derivative 
at the identity e G G is a linear map from the Lie algebra g to itself, which is denoted: 

Adg := {Rg-i o Lg),e : g ^ 0. (4.49) 

So letting g vary through G, the map Ad : g ^ Adg assigns to each g a member of 
End(g), the space of linear maps on (endomorphisms of) g. The chain rule implies that 
Adgh = AdgAdh- So 

Ad:g^ Adg (4.50) 

is a left action, a representation, of G on g: G x g — g. It is called the adjoint 
representation. 

Three useful results about Ad follow from our results (1) and (3) in Section r3.4.2I B 
(cf. eq. 13.521 Homomorphisms respect exponentiation): 

[1]: If ^ G g generates the one-parameter subgroup H = {exp(rX^) : r G IR}, then 
Adg{^) generates the conjugate subgroup Kg{H) = gHg~^. 

exp{Adg{0) = KgiexpO ■= gi^wOg''- (4-51) 

Incidentally, eq. 14.511 has a many-parameter generalization. Let H and H' be two 
connected r-dimensional Lie subgroups of the Lie group G, with corresponding Lie 
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subalgebras f) and [)' of the Lie algebra g = g(G). Then H and H' are conjugate 
subgroups, H' = gHg^^, iff i) and i)' are corresponding conjugate subalgebras, i.e. 

[2] : Eq. 14.511 also implies another result which will be needed for a crucial result 
about symplectic reduction, in Section Hi. 5. 21 (The many-parameter generalization just 
mentioned will not be needed.) It relates Ad to the pullback of an arbitrary action $. 

Thus let $ be a left action of G on M. Then for every g E G and ^ G g 

{AdgOM = ^^-i^M , (4.52) 
where $* indicates pullback of the vector field. For we have: 

(AdgOMix) := ^ |,=o ^(expirAdgO^x) (4.53) 

= 4- \r=o '^'(^(exp T^)g'\ x) by eq. EM (4.54) 
dr 

= — \r=0 ($5 O $e^p^^ O $^-1 (x)) (4.55) 

= ^<i'g_i(z)'^'g(^Af('^'9-i(a;))) bv the chain rule and eq. 14.371 (4.56) 
= (^g-i^Af) (x) by the definition of pullback. (4-57) 

Not only is this result needed later. Also, incidentally: it is the main part of the proof 
of result (4) at the end of Section^31 that ^ i-^ is a Lie algebra anti-homomorphism. 

[3]: Adg is an algebra homomorphism, i.e. 

Adg[^,7]] = [Adg^,Adg7]] , ^,^^0. (4.58) 

(2): Infinitesimal generators: the map ad: — 
The map Ad is differentiable. Its derivative at e G G is a linear map from the Lie 
algebra g to the space of linear maps on g. This map is called ad, and its value for 
argument ^ G g is written ad^. That is: 

ad := Ad:,e ■ Q Endg ; ad^ = ^ \r=o -44xp{r0 (4.59) 

where exp(r,^) is the one-parameter subgroup with tangent vector ^ at the identity. 
But if we apply the definition eq. 14.371 of the infinitesimal generator of an action, to 
the adjoint action Ad, we get that for each ^ G g, the generator ^g, i.e. a vector field 
on g, is 

: r/ G g ^ ^g(r/) G g with ^3(7/) := — |^=o ^4xp(ro(^)- (4-60) 

Comparing eq. I4.59[ we see that ad^ is just the infinitesimal generator ^g of the adjoint 
action corresponding to ^: 

ad^ = . (4.61) 
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We now compute the infinitesimal generators of tlie adjoint action. It will be crucial 
to later developments (especially Section 15 .41) that these are given by the Lie bracket 
in g. 

We begin by considering the function y4(icxp(T0 (^) differentiated. By eq. I4.49t 
we have 

Ade^p(roi'n) = TeiRcxpi-rO ° ^cxp(r5) ) (4.62) 
= (T'exp(T0(-Rexp(-r0) ° ^e-^exp(rO ) (^) 

)-X,(exp(rO) 

where the second line follows by the chain rule, and the third by definition of left- 
invariant vector field. Writing the flow of as (prid) = fl'expr^ = Rexp(T()g, and 
applying the definition of the Lie derivative (eq. I3.17p . we then have 

^b(^) ■= ^ \r=0 v4ciexp(rO(^) = ^ [^0r(e)</'r ^ " -^r? (</'r (c) )] \r=0 (4-63) 

= [X^,X,]{e) = [^,v]- 

where the final equation is the definition eq. l3.45l of the Lie bracket in the Lie algebra. 

So for the adjoint action, the infinitesimal generator corresponding to ^ is taking 
the Lie bracket: rj i— >■ [C,,^]]- To sum up: eq. 14.591 and 14.601 now become 

ad = Ad^e ■ Endg ; ad^ = ^ \r=o Ade^p(rO = ■ V ^ Q ^ [^^v] ^ 5- (4-64) 

(3): Example: matrix Lie groups: — 
In the case where G C GL{n,]R) is a matrix Lie group with Lie algebra g C Ql{n), 
these results are easy to verify. Writing n x n matrices as A,B G G, conjugation is 
Ka{B) = ABA~^, and the adjoint map Ad is also given by conjugation 

AdA{X) = AXA-\ AeG,X EQ. (4.65) 

So with A{t) = exp(rX), so that A{0) = I and A'{0) = X, we have with Y e Q 

A l^^Q Ad^^prxY = ^ |,=o [A{r)YA{r)-'] (4.66) 
= A'{0)YA-\0) + A{0)YA-''{0). 
But differentiating A{t)A^^{t) = I yields 

^(A-\t)) = -A-\r)A'{T)A-\r), and so ^"^'(0) = -A'{0) = -X (4.67) 
dr 



so that indeed we have 
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4- \r=o Ad^^prxY = XY-YX = [X, Y]. (4.68) 
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(4): Example: the rotation group: — 
It is worth giving details for the case of G = 
13.4.41 (eq. I3.79p that the three matrices 



5*0(3), g = so (3). We saw in Section 



A"" = \ -1 \ , Ay = \ \ , A' = \ 1 (4.69) 
\010/ \-100/ \000/ 

span so (3), and generate the one-parameter subgroups 

/I \ / sin^ \ / cos^ -sine 

i?^= cos^ -sine 1 i?^= 1 , i?^ = sin^ cos^ 
\0 sine cose / \ -sine cose / \ 1 

(4.70) 

representing anticlockwise rotation around the respective coordinate axes in the physi- 
cal space IR^. To calculate the adjoint action of Rg on the generator A^, we differentiate 
the product RgK^R^g with respect to r and set r = 0. That is, we find 

(0 — sin e cos e \ 
sine = cose- + sine- (4.71) 
-cose / 

We similarly find 

AdR.iA'') = A"", Ad n.{A') = -sine ■ Ay + cose ■ A'. (4.72) 

So the adjoint action of the subgroup Rg representing rotations around the x-axis of 
physical space is given by rotations around the A^-axis in the Lie algebra space so (3). 
Similarly for the other subgroups representing rotations around the y or 2;-axis. And 
so for any rotation matrix R G 5*0(3), relative to given axes x,y, z for IR^, its adjoint 
map Adu acting on so (3) ^ IR^ has the same matrix representation relative to the 
induced basis {A^,Ay,A^} of so(3). (NB: This agreement between 5*0(3)'s adjoint 
representation and its natural physical interpretation is special to 5*0(3): it does not 
hold for other matrix Lie groups.) 

Finally, the infinitesimal generators of the adjoint action are given by differentiation. 
For example, using eq. 14. 7H we find that 

adA^A'') := ^ \e=o Ad^^Ay = A' ; (4.73) 

de * 

which agrees with the commutator: A^ = [A^,Ay]. 
4.5.2 The co-adjoint representation 

Again we proceed in stages. We first define the representation, then discuss infinitesi- 
mal generators, and then take the rotation group as an example. 
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(1) : The representation defined: — 

We recall that a linear map A : V —>■ W induces (basis-independently) a transpose 
(dual) map, written A* (or A or A^), A* : W* V* on the dual spaces, V* := {a : 
— >^ IR I a linear } and similarly for W*; by 

Va eW*,yveV : A*{a){v) = < A*{a) ; v > := a{A{v)) = {a o A){v) . (4.74) 

So any representation, IZ say, of a group G on a vector space V ,TZ : G End(K), 
induces a representation TZ* of G on the dual space V*, by taking the transpose. We 
shall call TZ* the dual or transpose of IZ] it is also sometimes called a 'contragredient 
representation'. That is: for TZ{g) -.V ^V,we define TZ*{g) -.V* ^V* by 

TZ*{g) -.aeV* ^ n*{g){a) := a{TZ{g)) E V* . (4.75) 

Thus the adjoint representation of G on g induces a co-adjoint representation of G 
on the dual q* of its Lie algebra q, i.e. on the cotangent space to the group G at the 
identity, q* = T*G. The co-adjoint representation will play a central role in symplectic 
reduction (starting in Section 1^7^ . 

So let Ad* : 0* ^ 0* be the dual (aka: transpose) of Adg, defined by 

Va G 0*, ^ G : < Ad*ga] ^ > := < a; Adg^ > . (4.76) 

Since Ad : g i— ^ Adg is a left action {Adg^ = AdgAdh), the assignment g i— >■ Ad* is a 
right action. So to define a left action, we use the inverse g~^] cf. eq. 14.21 and 14.111 
Namely, we define the left action 

{g, a)eGxg* ^ Ad*g-,a G 0* ; (4.77) 

called the co-adjoint action of G on 0*. And the corresponding co-adjoint representation 
of G on 0* is denoted by 

Ad* -.G^ End(0*), Ad*g^, = {Te{Rg o Lg-i))* . (4.78) 

(2) : The map ad* ; infinitesimal generators: — 

The map Ad* is differentiable. Its derivative at e G G is a linear map from the Lie 
algebra to the space of linear maps on 0*. This map is called ad*, and its value for 
argument ^ G is written ad'^. Thus ad^ is an endomorphism of 0*, and we have 

ad* = Adl^ : ^ G ^ arf^ G End0* . (4.79) 

Now recall our deduction from eq. 14.591 and I4.6UI that ad^ = ^g, i.e. eq. 14.611 In the 
same way we here deduce an equality to the infinitesimal generator of the co-adjoint 
action: 

adl = . (4.80) 
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In fact, aci^ is, modulo a minus sign, the adjoint of ad^, in the usual sense of the 
natural pairing of a vector space with its dual: as we now show. (So the notation ad* 
is justified, modulo a minus sign.) 

Let us compute for this action, the value of the infinitesimal generator ^g. (a vector 
field on q*, induced by ^ G g) at the point a G g*. That is, we will compute the value 
^g*(a). As usual, we identify the tangent space (Tg*)^ in which this value hves, with 
g* itself; and similarly for g. So, with ^g* acting on 77 G g, we compute: 

<adliay,7]> = <^,4ay,r]> = /^l^^Adl^^^^^{a)-rS (4.81) 



^ dr 

^ |r=0 (^Cp-r5(tt);^> = ^ |r=0 (a;-44xp-r5r/) (4.82) 



a; ^ |r=o Ade^p-r^r]^ = < a; -[^,1]] > = - < a;ad^{r]) > . (4.83) 

So ad^, defined as the derivative of Ad* is, up to a sign, the adjoint of ad^. 

(3): Example: the rotation group: — 
Let us now write the elementary vector product in as A, and identify so(3) = 
(H^, a) and 30 (3)* = IR'^ . And let us have the natural pairing given by the elementary 
euclidean inner product •. Then the result just obtained (now with • marking the 
argument-place) 

<er(a);«>=-<«;[e,H> (4.84) 

becomes for a G so(3)* and ^ G so(3) 

6o(3)*(«) ■• = -a- (^A.) . (4.85) 

So for rj G so (3), we have 

< ^so(3)-(a);r? > = ^so(3)- (a) ■ 77 = -a-{^Ari) = -{aA^)-ri = - < aA^^rj > . (4.86) 
In short: 

eso(3)*(«) = -«Ae = eA«. (4.87) 

Now since 5*0(3) is compact, we know that the co-adjoint action is proper; so 
Orb(Q;) is a closed submanifold of so(3)*, and eq. 14.451 of Section lOl applies. So if we 
fix a, and let ^ vary through so (3) = IR^, we get all of the tangent space TQ,Orb(a) to 
the orbit passing through a. Then eq. 14.871 implies that the tangent space is the plane 
normal to a, and passing through a's end-point. Letting a vary through so(3)*, we 
conclude that the co-adjoint orbits are the spheres centred on the origin. 

In the following Sections, we will see that the orbits of the co-adjoint representa- 
tion of any Lie group G have a natural symplectic structure. So the orbits are always 
even-dimensional; and by considering all Lie groups and all possible orbits, we can get 
a series of examples of symplectic manifolds. 
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Besides, this fact will play a central role in our generalized formulation of Hamil- 
tonian mechanics, and in symplectic reduction. And we will (mercifully!) get a good 
understanding of that role, already in Section 15.11 To prepare for that, it is worth 
gathering some threads about our recurrent example, SO {3); and generalizing them to 
other Lie groups ... 

4.6 Kinematics on Lie groups 

To summarize some aspects of this Section, and to make our later discussion of reduc- 
tion clearer, it is worth collecting and generalizing some of our results about 5*0(3) 
and the description it provides of the rigid body. More precisely, we will now combine: 

(i) : the description of space and body coordinates in terms of left and right trans- 
lation, at the end of Section 13.4.41 

(ii) : the cotangent lift of translation (example (viii) of Section UlTl A): 

(iii) : the adjoint and co-adjoint representations of 5*0(3) (as in (4) of Section ll.5.1l 
and (3) of Section 11321 

We will also generalize: namely, we will consider (i) to (iii) for an arbitrary Lie 
group G, not just for 50(3). (The point of doing so will become clear in (3) of Section 
15.11 ) This will occur already in Section [4.6. II Then in Section 14.6.21 we will show how 
this material yields natural diffeomorphisms 

TG^GxQ and T*G ^ G x q* ; (4.88) 

(so if dimO = n, then all four manifolds are 2?T,-dimensional) . We will also see that by 
applying Section H^^ s notion of equivariance, we can "pass to the quotients", and get 
from eq. 14.881 the natural diffeomorphisms 

TG/G^Q and T*G/G ^ q* ; (4.89) 

where the quotients on the left hand sides (the domains) is by the action of left trans- 
lation; (to be precise: by the action of its derivative for TG, and its cotangent lift for 
T*G). 

4.6.1 Space and body coordinates generalized to G 

So let a (finite-dimensional) Lie group G act on itself by left and right translation, Lg 
and Rg. For any g E G,we define 

Xg-.TgG^Qhj vE TgG ^ {T,Lg)-\v) = {TgLg-.){v) G Q . (4.90) 

We similarly define 

Pg-.v&TgG^ {T,R,)-\v) ^ {TgRg-.){v) e . (4.91) 



On analogy with the case of the pivoted rigid body (cf. eq. 13.851 and l3.861 or eq. I4.22p . 

we say that Xg represents v G TgG in body coordinates, and Pg represents v in space 
coordinates. We also speak of body and space representations. The transition from 
body to space coordinates is then an isomorphism of g; viz. by eq. 14.491 

Ve e g, (pg o X~'m = PaiTeLgii)) ^ Adgi . (4.92) 

So we can combine the S and B superscript notation of eq. 14.221 with Section 14.5. U s 
notion of the adjoint representation, and write 

= AdgV^ . (4.93) 

In a similar way, the cotangent lifts of left and right translation provide isomor- 
phisms between the dual spaces T*G,g G G and q*. Thus for any g & G, we define 

A, : T;G ^ b* by « G T;G ^ a o T,Lg = {T^LgYia) = {T:Lg){a) G g* ; (4.94) 

and similarly 

Pg:aeT;G^ao T,Rg = (T;i?,)(a) G Q* . (4.95) 
And we again use the S and B superscript notation of eq. 14.221 and define for a G T*G 

:= (T;i?J(a) = Pg{a) and := (T;Lg)(a) = Xg{a) , (4.96) 

which are called the space (or 'spatial') and body representations, respectively, of a. 
The transition from body to space representations is now an isomorphism of g*; viz. 

VaG0*, {pgoXg'){a) = Adl-^ia) , i.e. = ^^^-^(a^) . (4.97) 
4.6.2 Passage to the quotients 

For later purposes, we need to develop the details of how the element g E G "carries 
along throughout" in eq. I4.9UI to 14.971 More precisely, we have two isomorphisms: 

TG^GxQ and T*G ^ G x q* . (4.98) 

These are isomorphisms of vector bundles; but we shall not develop the language of 
fibre bundles. What matters for us is that once we exhibit these isomorphisms, we will 
see that we have equivariant maps relating two group actions, in the sense of eq. 14.291 
and 14.301 And this will mean that we can pass to the quotients to infer that TG/G is 
diffeomorphic to g, and correspondingly that T*G/G is diffeomorphic to g*. 

This last diffeomorphism will form the first part of Section [7|s main theorem, the 
Lie-Poisson reduction theorem, which says that T*G/G and g* are isomorphic as Pois- 
son manifolds. In Section El onwards, we will develop the notion of a Poisson manifold, 
and the significance of this isomorphism for the reduction of mechanical problems. 

I should note here that there is a parallel story about the first diffeomorphism, i.e. 
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about TG/G being diffeomorphic to q. It forms the first part of anotlier reduction 
tlieorem, wliich is tlie Lagrangian analogue of Section [Tfs Lie-Poisson theorem. But 
since this Chapter has adopted the Hamiltonian approach, I will not go into details. 
They can be found in Marsden and Ratiu (1999: Sections 1.2, 13.5, 13.6), under the 
title 'Euler-Poincare reduction'. 

Thus corresponding to eq. 14.901 we define the isomorphism 

A : TG ^ G X by X{v) := {g, {T,L^r\v)) = {g, {TgLg-i){v)) (4.99) 

with V G TgG, i.e. g = 71g{v) and ttq '■ TG — ^ G the canonical projection. (As 
mentioned concerning eq. 14.61 it is harmless to (follow many presentations and) conflate 
a point in TG, i.e. strictly speaking a pair {g,v),g E G,v E TgG, with its vector v.) 
And corresponding to eq. 14.911 we define the isomorphism 

p:TG^GxQ by p{v):={g,{TMgr\v)) = {g,{TgRg-^){v)). (4.100) 

The transition from body to space representations given by eq. 14.921 now implies 

ipoX-')ig,0=p{9,T,Lg{0) = {g,{T,Rg)-^oT,Lg{0) = {g,AdgO- (4-101) 

In a similar way, the cotangent bundle T*G is isomorphic in two ways to G x g*: 
namely by 

A(«) := (g, a o T,Lg) = {g, (T;L,)«) E G x g* , (4.102) 

and by 

p{a) := (g, a o T^Rg) = {g, {T:Rg)a) E G x q* (4.103) 

where a E T*G, i.e. g = vr^(a) with ttq : T*G G the canonical projection. (Again, 
we harmlessly conflate a point {g, a) in T*G with its form a E T*G.) 

Let us now compute in the body representation, the actions of: (i) the (derivative 
of the) left translation map, TLg, and (ii) the corresponding cotangent lift T*Lg. This 
will show that A and A are equivariant maps for certain group actions. 

(i): We compute: 

(A o TLg o X-'){h, = (A o TLg){h, TL^iO) = KqK {TLg o TL,)(0) (4.104) 

= igh,iiTLg,)-'oTLg,m) = {gh,0. (4.105) 

So in the body representation, left translation does not act on the vector component. 
(That is intuitive, in that the vector ^ is "attached to the body" and so should not 
vary relative to coordinates fixed in it.) Eq. 14.1051 means that A is an equivariant 
map relating left translation TLg on TG to the G-action on G x g given just by left 
translation on the first component: 

^g{{h,0)^g-ih,0:={9h,0- (4.106) 
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Equivariance means that A induces a map A on the quotients. That is: as in eq. 
14.311 the map 

A : TG/G ^ (G X q)/G (4.107) 
defined as mapping, for any the orbit of any v G TgG to the orbit of A(f), i.e. 

A : Orb(f) = {ueTG\ TgLhiy) = u, some h e G} ^ Orb(A(t;)) (4.108) 

= {{hg, {TeLg)-\v)) I some h e G} (4.109) 

is well-defined, i.e. independent of the chosen representative v of the orbit. 

Besides, since the canonical projections, v G TG t— > Orb(f) G TG/G and {g,C,) t— > 
Orb(((?,^)) G {G X q)/G, are submersions, we can apply result (1) of Section and 
conclude that A is smooth. 

Finally, we notice that since the action of left translation is transitive, we can 
identify each orbit of the $ of eq. 14. 1061 with its right component C ^ 0j so we can 
identify the set of orbits (G x q)/G with q. 

To sum up: we have shown that TG/G and (G x q)/G, i.e. in effect q, are diffeo- 
morphic: 

A : TG/G ^ (G X g)/G = q . (4.110) 

(ii): The results for the cotangent bundle are similar to those in (i). On analogy 
with eq. 14.1051 the action of the cotangent lift of left translation T*Lg is given in body 
representation by applying eq. 14.1021 to get 

(A o {T*Lg) o X-^){h, a) = [g-X a) ; (4.111) 

or equivalently, now taking the cotangent lift of left translation to define a left action 
(cf. eq. EH), 

(A o (T* VO ° A-i)(/i, a) = {gh, a) . (4.112) 

So in body representation, left translation does not act on the covector component; 
(again, an intuitive result in so far as a is "attached to the body"). So eq. 14. 11 21 means 
that A is an equivariant map relating the cotangent lifted left action of left translation 
on T*G to the G-action on G x g* given just by left translation on the first component: 

^g{{h,a)) = g-{h,a):={gh,a). (4.113) 

So, on analogy with eq. 14.1071 and 14.1091 we can pass to the quotients, defining a map 

A : T*G/G ^ (G X Q*)/G (4.114) 

by requiring that for a G T*G, so that T*Lh-ia G T^gG: 

A : Orb(a) = {f3 e T*G \ f3 = T*Lh-i{a), some h e G} ^ (4.115) 
Orb(A(a)) = {{hg, {T*Lg){a)) \ some h e G} = {{h, {T*Lg)a) \ some h e G} (4.116) 
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And finally, we identify the set of orbits {G x Q*)/G with q*, so that we conclude 

that T*G/G and g* are diffeomorphic. That is, we think of the diffeomorphism A as 
mapping T*G/G to g*: 

A : Orb(a) = e T*G \ f3 = T*Lh-i{a), some h e G} e T*G/G ^ {T*Lg){a) G Q*. 

(4.117) 

As I said above, this diffeomorphism is the crucial first part of Section [3s main reduc- 
tion theorem. But we will see its role there, already in (3) of Section I^TTl 

Finally, a result which will not be needed later. To calculate the derivatives and 
cotangent lifts of left translation in space representation, we replace A and A by p and 
p as defined by eq. I4.10UI and 14.1031 We get as the analogues of eq. 14.1051 and 14.1111 
respectively: 

(p o TLg o p-')(h, = (gh, AdgiO) , (4.118) 

and 

(p o T*Lg o p-i)(/i, a) = {g-X Ad^a)) . (4.119) 

Though these results are not needed later, they are also analogues of some later results, 
eq. 16.891 and I6.90[ which we will need. (Note that, in accordance with the discussion 
between eq. 14.761 and 14. 77^ eq. 14.1191 involves right actions.) 

5 Poisson manifolds 

5.1 Preamble: three reasons for Poisson manifolds 

Now that we are equipped with Sections 121 and |3Js toolbox of modern geometry, we can 
develop, in this Section and the two to follow, the theory of symplectic reduction. This 
Section develops the general theory of Poisson manifolds, as a framework for a general- 
ized Hamiltonian mechanics. Its main results concern the foliation, and quotienting, of 
Poisson manifolds. Then Section IHl returns us to symmetries and conserved quantities: 
topics which are familiar from Section I2.1.3| but which Section IHl will discuss in the 
generalized framework using the notion of a momentum map. Finally, in Section [3 all 
the pieces of our jigsaw puzzle will come together, in our symplectic reduction theorem. 

We already glimpsed in (1) of Section YI7]\ the idea of a Poisson manifold as a gen- 
eralization of a symplectic manifold, that provides the appropriate framework for a 
generalized Hamiltonian mechanics. It is a manifold equipped with a bracket, called a 
'Poisson bracket', that has essentially the same formal defining properties as in sym- 
plectic geometry except that it can be "degenerate". In particular, the dimension m 
of a Poisson manifold M can be even or odd. As we will see, Hamiltonian mechanics 
can be set up on Poisson manifolds, in a natural generalization of the usual formal- 
ism: there are m first-order ordinary differential equations for the time evolution of 
local coordinates x^, ...,x"^, and the time-derivative of any dynamical variable (scalar 
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function on the Poisson manifold M) is given by its Poisson bracket with the Hamilto- 
nian. Besides, this generahzation reduces to the usual formalism in the following sense. 
Any Poisson manifold M is foliated into symplectic manifolds, and any Hamiltonian 
mechanics of our generalized kind defined on M restricts on each symplectic leaf to a 
conventional Hamiltonian mechanics using the induced symplectic form. 

This last point, the invariance of the symplectic leaves under the dynamics, prompts 
the question 'why bother with the Poisson manifold, since the dynamics can be written 
down on each leaf?'. There are three reasons. I will just mention the first; the rest 
of Section will develop the second; and the two subsequent Sections will develop the 
third. 

(1) : Parameters and stability: — 

The first two reasons concern the fact that for many problems in Hamiltonian mechan- 
ics, it is natural to consider an odd-dimensional state-space. One principal way this 
happens is if the system is characterized by some odd number, say s (maybe s = 1), 
of parameters that are constant in time. Then even though for a fixed value of the 
parameter (s), there is a Hamiltonian mechanics on a symplectic manifold, of dimension 
2n say, it is useful to envisage the 2n + s dimensional space in order to keep track of 
how the behaviour of systems depends on the parameters. For example, this is very 
useful for analysing stability, especially if one can somehow control the value of the pa- 
rameters. Stability theory (and related fields such as bifurcation theory) are crucially 
important, and vast, topics — which I will not go into.^^ 

(2) : Odd- dimensional spaces: the rigid body again: — 

Secondly, even in the absence of such controllable parameters, there are mechanical 
systems whose description leads naturally to an odd- dimensional state-space. The 
paradigm elementary example is the rigid body pivoted at a point (mentioned in (3) of 
Section 123) . An elementary analysis, repeated in every textbook, leads to a descrip- 
tion of the body by the three components of the angular momentum (relative to body 
coordinates, i.e. coordinates fixed in the body): these components evolve according to 
the three first-order Euler equations. 

This situation prompts two foundational questions; (which of course most text- 
books ignore!). First, we note that a configuration of the body is given by three real 
numbers: viz. to specify the rotation required to rotate the body into the given con- 
figuration, from a fiducial configuration. So a conventional Hamiltonian description of 
the rigid body would use six first-order equations. (Indeed, similarly for a Lagrangian 
description, if we treat the three qs as variables.) So how is the description by Euler 's 
equations related to a six- dimensional Hamiltonian (or indeed Lagrangian) description? 

Second, can the description by the Euler equations be somehow regarded as itself 
Hamiltonian, or Lagrangian? 

^^Except to note a broad philosophical point. These parameters illustrate the modal or coun- 
terfactual involvements of mechanics. The s dimensions of the state-space, and the mathematical 
constructions built on them, show how rich and structured these involvement are. For a detailed 
discussion of the modal involvements of mechanics, cf. Butterfield (2004). 
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This Chapter will not pursue these questions about the rigid body; for details, cf. 
the references at the end of (3) of Section 12.21 For us, the important point is that 
the theory of symplectic reduction shows that the answer to the second question is 
Yes. Indeed, a "resounding Yes" . For we will see very soon (in Section I5.2.41 A) that 
the three-dimensional space of the components, in body coordinates, of the angular 
momentum is our prototype example of a Poisson manifold; and the evolution by 
Euler's equations is the Hamiltonian mechanics on each symplectic leaf of this manifold. 
In short: in our generalized framework, Euler's equations are already in Hamiltonian 
form. 

Furthermore, this Poisson manifold is already familiar: it is 5o(3)*, the dual of the 
Lie algebra of the rotation group. Here we connect with several previous discussions 
(and this Chapter's second motto). 

First: we connect with the discussion of rotation in Relationist and Reductionist 
mechanics f Sections 12. 3. 31 to 12. 3. 5|) . In particular, cf. comment (iii) about 7, the three 
variables encoding the total angular momentum of the system, at the end of Section 
12.3.41 (So as regards (l)'s idea of labelling the symplectic leaves by parameters constant 
in time: in this example, it is the magnitude L of the total body angular momentum 
which is the parameter.) 

Second: we connect with Section Il3.4.4f s discussion of so(3), with Section l4.5.2[ s 
discussion of the co-adjoint representation on so(3)*, and with Section H^ s discussion 
of kinematics on an arbitrary Lie group. As regards the rigid body, the main physical 
idea is that the action of S0{3) on itself by left translation is interpreted in terms of 
the coordinate transformation, i.e. rotation, between the space and body coordinate 
systems. 

But setting aside the rigid body: recall that in Section ll.5.2l we saw that for so (3)*, 
the co-adjoint orbits are the spheres centred on the origin. I also announced that they 
have a natural symplectic structure — and that this was true for the orbits of the co- 
adjoint representation of any Lie group. Now that we have the notion of a Poisson 
manifold, we can say a bit more, though of course the proofs are yet to come: — 

For any Lie group G, the dual of its Lie algebra g* is a Poisson manifold; 
and G has on q* a co-adjoint representation, whose orbits are the symplectic 
leaves of g* as a Poisson manifold. 

In particular, we remark that the theory of the rigid body just sketched is inde- 
pendent of the dimension of physical space being three: it carries over to so{n)* for 
any n. So we can readily do the Hamiltonian mechanics of the rigid body in arbitrary 
dimensions. That sounds somewhat academic! But it leads to a more general point, 
which is obviously of vast practical importance. 

In engineering we often need to analyse or design bodies consisting of two or more 
rigid bodies jointed together, e.g. at a universal joint. Often the configuration space 
of such a jointed body can be given by a sequence of rotations (in particular about 
the joints) and-or translations from a fiducial configuration; so that we can take an 
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appropriate Lie group G as the body's configuration space. If so, we can try to mimic 
our strategy for tlie rigid body, i.e. to apply the result just announced. And indeed, 
for such bodies, the action of left translation, and so the adjoint and co-adjoint repre- 
sentations of G on g and g*, can often be physically significant. 

But leaving engineering aside, let us sum up this second reason for Poisson manifolds 
as follows. For some mechanical systems the natural state-space for a Hamiltonian 
mechanics is a Poisson manifold. And in the paradigm case of the rigid body, there is 
a striking interpretation of the Poisson manifold's leaves as the orbits of the co-adjoint 
representation of the rotation group S0{?>). 

(3): Reduction: — 

My first two reasons have not mentioned reduction. But unsurprisingly, they have 
several connections with the notion. Here I shall state just one main connection, which 
links Section I4.6[ s kinematics on Lie groups to our main reduction theorem: this will 
be my third motivation for studying Poisson manifolds. 

In short, the connection is that: — 

(i) : For various systems, the configuration space is naturally taken to be a Lie group 
G] (as we have just illustrated with the rigid body). 

(ii) : So it is natural to set up an orthodox Hamiltonian mechanics of the system 
on the cotangent bundle T*G. But (as in the Reductionist procedure of Section 1^. 3. 4|) 
it is also natural to quotient by the lift to the cotangent bundle of G's action on itself 
by left translation. 

(iii) : When we do this, the resulting reduced phase space T*G/G is a Poisson 
manifold. Indeed it is an isomorphic copy of q*. That is, we have an isomorphism of 
Poisson manifolds: g* = T*G/G. This is the Lie-Poisson reduction theorem. 

I shall give a bit more detail about each of (i)-(iii). 

(i) : For various systems, any configuration can be obtained by acting with an 
element of the Lie group G on some reference configuration which can itself be labelled 
by an element of G, say the identity e G G. So we take the Lie group G to be the 
configuration space. As mentioned in (3) of Section 12.21 there is even an infinite- 
dimensional example of this: the ideal fiuid. 

(ii) : So T*G is the conventional Hamiltonian phase space of the system. But G 
acts on itself by left translation. We can then consider the quotient of T*G by the 
cotangent lift of left translation. Intuitively, this is a matter of "rubbing out" the way 
that T*G encodes (i)'s choice of reference configuration. By passing to the quotients 
as in Section [4.61 we infer that T*G/G is a manifold. But of course it is in general 
not even- dimensional. For its dimension is |dim(T*G) = dim(G). So consider any 
odd- dimensional G: for example, our old friend, the three-dimensional rotation group 
50(3). 

(iii) : But T*G/G is always a Poisson manifold. And it is always isomorphic as a 
Poisson manifold to g*, with its symplectic leaves being the co-adjoint orbits of g*: 
0* = T*G/G. 
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I end this third reason for studying Poisson manifolds with two remarks about 
examples. 

The first remark echoes the end of Section 14.5.21 where I said that by considering 
all possible Lie groups and all the orbits of their co-adjoint representations, we get a 
series of examples of symplectic manifolds. We can now put this together with the 
notion of a Poisson manifold, and with the comment at the end of Section I3.4.3| that 
every (finite-dimensional) Lie algebra is the Lie algebra of a Lie group. In short: we 
get a series of examples of Poisson manifolds, in either of two equivalent ways: from 
the dual q* of any (finite-dimensional) Lie algebra g; or from the quotient T*G/G 
of the cotangent lift of left translation. In either case, the example is the co-adjoint 
representation. 

The second remark is that there are yet other examples of Poisson manifolds and 
reductions. Indeed, we noted one in Section 12.3.41 viz. the Reductionist's reduced 
phase space M := M/E, obtained by quotienting the phase space M := T*IR^^ — (5uA) 
by the (cotangent lift) of the action of the euclidean group E on IR^^. But I shall not 
go into further details about this example; (for which cf. the Belot papers listed in 
Section 1!^. 3. H and references therein). Here it suffices to note that this example is not of 
the above form: IR is not E, and the action of -E on IR is not left translation. This 
of course echoes my remarks at the end of Section 11.21 that the theory of symplectic 
reduction is too large and intricate for this Chapter to be more than an "appetizer" . 

So much by way of motivating Poisson manifolds. The rest of this Section will 
cover reasons (1) and (2); but reason (3), about reduction, is postponed to Sections IHl 
and[7| We give some basics about Poisson manifolds, largely in coordinate-dependent 
language, in Section 15.21 In Section 15. 3( we move to a more coordinate-independent 
language and show that Poisson manifolds are foliated into symplectic manifolds. In 
Section 15.41 we show that the leaves of the foliation of a finite-dimensional Lie algebra 
Q* are the orbits of the co-adjoint representation of G on q*. Finally in Section [5.51 
we prove a general theorem about quotienting a Poisson manifold by the action of Lie 
group, which will be important for Section [7|s main theorem. 

5.2 Basics 

In Sections 15.2.11 to 15.2.31 we develop some basic definitions and results about Pois- 
son manifolds. This leads up to Section 15.2.4[ where we see that the dual of any 
finite-dimensional Lie algebra has a natural (i.e. basis-independent) Poisson manifold 
structure. Throughout, there will be some obvious echoes of previous discussions of 
anti-symmetric forms, Poisson brackets, Hamiltonian vector fields and Lie brackets 
f Sections 12.11 and 13. 2j) . But I will for the most part not articulate these echoes. 



5.2.1 Poisson brackets 



A manifold M is called a Poisson manifold if it is equipped with a Poisson bracket 
(also known as: Poisson structure). A Poisson bracket is an assignment to each pair 
of smooth real- valued functions F, : M — > H, of another such function, denoted by 
{F, subject to the following four conditions: — 

(a) Bilinearity: 

{aF + bG, H} = a{F, H} + b{G, H} ; {F, aG + bH} = a{F, G} + b{F, H} Va, be JR. 

(5.1) 

(b) Anti-symmetry: 

{F,H} = -{H,F}. (5.2) 

(c) Jacobi identity: 

{{F, H}, G} + {{G, F}, H} + {{H, G}, F} = . (5.3) 

(d) Leibniz' rule: 

{F,H-G} = {F,H}-G + H- {F, G} . (5.4) 

In other words: M is a Poisson manifold iff both: (i) the set J-'{M) of smooth scalar 
functions on M, equipped with the bracket {, }, is a Lie algebra; and (ii) the bracket 
{, } is a derivation in each factor. 

Any symplectic manifold is a Poisson manifold. The Poisson bracket is defined by 
the manifold's symplectic form; cf. eq. 12.181 

"Canonical" Example: — 
Let M = IR"*, m = 2n+l, with standard coordinates {q,p, z) = (g^, q"',p^, z^, z'-). 

Define the Poisson bracket of any two functions F{q,p, z), H{q,p, z) by 

Thus this bracket ignores the z coordinates; and if I were equal to zero, it would be 
the standard Poisson bracket for IR^" as a symplectic manifold. We can immediately 
deduce the Poisson brackets for the coordinate functions. Those for the gs and ps are 
as for the usual symplectic case: 

{q\ q^] = y ,pi} = {q\pi] = 6,,. (5.6) 

On the other hand, all those involving the zs vanish: 

{q\z^} = {p\z^} = {z\z^}^Q. (5.7) 

Besides, any function F depending only on the z's, F = F[z) will have vanishing 
Poisson brackets with all functions H : {F, H} = 0. 
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This example seems special in that M is foliated into 2r;,-dimensional symplectic 
manifolds, each labelled by / constant values of the zs. But Section 15.3.41 will give a 
generalization for Poisson manifolds of Darboux's theorem (mentioned at the end of 
Section [2.1 .11) : a generalization saying, roughly speaking, that every Poisson manifold 
"looks locally like this". 

For any Poisson manifold, we say that a function F : M — > IR is distinguished 
or Casimir if its Poisson bracket with all smooth functions if : M — IR vanishes 
identically: {F, H] = 0. 



5.2.2 Hamiltonian vector fields 



Given a smooth function H : M ^ ]R, consider the map on smooth functions: F i— »• 
{F, H}. The fact that the Poisson bracket is bilinear and obeys Leibniz's rule implies 
that this map F t— > {F, H} is a derivation on the space of smooth functions, and so 
determines a vector field on M; (cf. (ii) of Section Hj.l. 21 B). We call this vector field the 
Hamiltonian vector field associated with (also known as: generated by) H, and denote 
it by Xh- 

But independently of the Poisson structure, the action of any vector field Xh on 
a smooth function F, Xh{F), also equals Lx„{F) = dF^Xn); (cf. eq. I3.12|) . So we 
have for all smooth F 

Lx„{F) = dFiXn) = Xh{F) = {F,H} . (5.8) 

The equations describing the flow of Xh are called Hamilton's equations, for the choice 
of H as "Hamiltonian". 

In the previous example with M = IR^""*"', we have 



Qpi Qqi Qqi Qpi 

and the flow is given by the ordinary differential equations 
dq^ dH dp^ OH dz^ 



dt dp^ dt dq^ dt 



0. i = l,...,n; j = l,...,/. (5.10) 



Again, the zs, and any function F{z) solely of them, are distinguished and have a 
vanishing Hamiltonian vector field. On the other hand, the coordinate functions 
and generate the Hamiltonian vector fields — ^ and ^ respectively. 

Two further remarks about eq. 15.81 — 

(1) : It follows that a function H is distinguished (i.e. has vanishing Poisson brackets 
with all functions) iff its Hamiltonian vector field X^ vanishes everywhere. And since 
the Poisson bracket is antisymmetric, this is so iff H is constant along the flow of all 
Hamiltonian vector fields. 

(2) : This equation is the beginning of the theory of constants of the motion (first 



integrals), and of Noether's theorem, for Poisson manifolds; just as the corresponding 
equation was the beginning for the symplectic case. This will be developed in Section 
il 

Poisson brackets and Lie brackets: — 
With the definition eq. 15.81 in hand, we can readily establish our first important con- 
nection between Poisson manifolds and Sectionals Lie structures. Namely: result (2) 
at the end of Section 13.2.21 eq. 13.321 is also valid for Poisson manifolds. 

That is: the Hamiltonian vector field of the Poisson bracket of scalars F, if on a 
Poisson manifold M is, upto a sign, the Lie bracket of the Hamiltonian vector fields, 
Xp and Xh, of F and H: 

X{p,H} = -[Xf,Xh] = [Xh,Xf] . (5.11) 

The proof is exactly as for eq. 13.321 

So the Hamiltonian vector fields, with the Poisson bracket, form a Lie subalgebra 
of the Lie algebra of all vector fields on the Poisson manifold M. This result will 
be important in Section 15. 3. 3f s proof that every Poisson manifold is a disjoint union of 
symplectic manifolds. 



5.2.3 Structure functions 

We show that to compute the Poisson bracket of any two functions given in some local 
coordinates x = x^, ...,x"^, it suffices to know the Poisson brackets of the coordinates. 
For any function if : M — > IR, let the components of its Hamiltonian vector field in the 
coordinate system x be written as h^{x). So X^ = h\x)-^. Then for any other 
function F, we have 

{F,H} = XH{F) = J:Th\x)^^. (5.12) 
Taking as the function F, we get: {x*, H} = X/^(x*) = h^{x). So eq. 15.121 becomes 

{f,h} = j:t{x\h}^- (5.13) 



If we now put x* for H and H for F in eq. I5.13t we get 

dH 

dx^ 



{x\H} = -{H,x'} = -X,.{H) = -E- {x\x'} — .. (5.14) 



Combining eq . s 15 . 1 3l and 15 . llj we get the basic formula for the Poisson bracket of any 
two functions in terms of the Poisson bracket of local coordinates: 

{F.H)^j:7T.f{.;^)--. (5.15) 

We assemble these basic brackets, which we call the structure functions of the Poisson 
manifold, 

r{x) := {x\x^] i,j = l,...,m (5.16) 



into a m X m anti-symmetric matrix of functions, J{x), called the structure matrix of 
M. More precisely, it is the structure matrix for M relative to our coordinate system 
X. Of course, the transformation of J under a coordinate change 
is determined by setting F := x'^,H := x'^ in the basic formula eq. 15.151 

Then, writing VH for the (column) gradient vector of H, eq. 15.151 becomes 

{F,H} = VF ■ JVH. (5.17) 

For example, the canonical bracket on IR^""*"', eq 15.51 written in the {q,p, z) coordinates, 
has the simple form 

/ / 0\ 
J = -/ . (5.18) 
\ / 

where / is the n x n identity matrix. 

We can write the Hamiltonian vector field, and the Hamilton's equations, associated 
with the function H in terms of J. Since 



{x\H} = J:f{x\x^}— (5.19) 



we get: 

x„ = sr (sr . (5.20) 

or in matrix notation: Xh = (JVH) • dx- Similarly, Hamilton's equations 

- = (5.21) 

get the matrix form 

I ^ J(.)VH(.) ; ..e. ^ ^ Er . (5.22) 

To summarize how we have generalized from the usual form of Hamilton's equations: 
compare eq. 15.221 15.171 and 15.181 respectively with eq. I2.12t 12.181 and 12.31 

Note that not every mxm anti-symmetric matrix of functions on an m-dimensional 
manifold (or even: on an open subset of IR™) is the structure matrix of a Poisson 
manifold: for the Jacobi identity constrains the functions. In fact it is readily shown 
that the Jacobi identity corresponds to the following partial differential equations 
governing the J^^{x), which are in general non-linear. Writing as usual di for d/dx^: 

{r^diJ^'' + j'^^diJ'^ + J^^diJ''') = k, = 1, m; Vx G M. (5.23) 

In particular, any constant anti-symmetric matrix J defines a Poisson structure. 
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5.2.4 The Poisson structure on g* 



We can now show that any m-dimensional Lie algebra g defines a Poisson structure, 
often caUed the Lie-Poisson bracket, on any m-dimensional vector space V . We proceed 
in two stages. 

(1) : We first present the definition in a way that seems to depend on a choice of 
bases, both in g (where the definition makes a choice of structure constants) and in 
the space V . 

(2) : Then we will see that choosing V to be g*, the definition is in fact basis- 
independent. 

This Poisson structure on g* will be of central importance from now on. As 
Marsden and Ratiu write: 'Besides the Poisson structure on a symplectic manifold, 
the Lie-Poisson bracket on g*, the dual of a Lie algebra, is perhaps the most funda- 
mental example of a Poisson structure' (1999: 415). Here we return to our motivating 
discussion of Poisson manifolds, especially reasons (2) and (3) of Section 15.11 which 
concerned the rigid body and reduction, respectively. Indeed, we will see already in the 
Example at the end of this Subsection (Section I5.2.4I A) how the Lie-Poisson bracket 
on the special case g* := so(3)* clarifies the theory of the rigid body. And we will 
see in Sections 17.21 and 17.3.31 how for any g, the Lie-Poisson bracket on g* is induced 
by reduction, from the canonical Poisson (viz. symplectic) structure on the cotangent 
bundle T*G. This will be our reduction theorem, that T*G/G = g*. 

After (2), we will see that the Lie-Poisson bracket on g* implies that Hamilton's 
equations on g* can be expressed using ad*: a form that will be needed later. This will 
be (3) below. Then we will turn in Section 15. 2. 4[ A to the example g* := so(3)*. 

(1): A Poisson bracket on any vector space V: — 
Take a basis, say ei, e^, in g, and so structure constants cf^- (cf. eq. I3.24|) . Consider 
the space \^ as a manifold, and coordinatize it by taking a basis, ei, ...,em say, deter- 
mining coordinates x^, We now define the Poisson bracket (in this case, often 
called the Lie-Poisson bracket) between two smooth functions F, if : \^ — > IR to be 



{f.//}:=ES,,, 4.'^-. (5.24) 



This takes the form of eq. 15. 15^ with linear structure functions J^^{x) = c^jx''. One 
easily checks that anti-symmetry, and the Jacobi identity, for the structure constants, 
eq. I3.25[ implies that these J*-^ are anti-symmetric and obey their Jacobi identity eq. 
15.231 So eq. 15.241 defines a Poisson bracket on V. 



In particular, the associated Hamiltonian equations, eq.s 15.211 and I5.22| take the 
form 

% - sr.- 4-'i. (5.25) 

(2): The Lie-Poisson bracket on g*: — 
To give a basis-independent characterization of the Lie-Poisson bracket, we first recall 
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that: 

(i) : the gradient VF{x) of F : V —>■ M. at any point x G K is in the dual space V* 
of (continuous) hnear functionals on V;: 

(ii) : any finite-dimensional vector space is canonically, i.e. basis- independently, 
isomorphic to its double dual: {V*)* = V. 

Then writing < ; > for the natural pairing between V and V*, we have, for any 
yeV 

< VF(.); , >:= lim^o^^^^^^^^^^^^. (5.26) 

T 

Now let us take V in our definition of the Lie-Poisson bracket to be g*. So we will 
show that g makes g* a Poisson manifold, in a basis-independent way. And let the basis 
ei, em be dual to the basis ei, Cm of 0. If F : — > IR is any smooth function, its 
gradient VF{x) at any point x e g* is an element of (g*)* = g. One now checks that 
the Lie-Poisson bracket defined by eq. 15.241 has the basis-independent expression 

{F,H}{x) =<x;[VF{x),VH{x)]> , x e g* (5.27) 

where [, ] is the ordinary Lie bracket on the Lie algebra g itself. 

(3): Hamilton's equations on g*: — 
We can also give a basis-independent expression of the Hamilton's equations eq. 15.251 
viz. by expressing the Lie bracket in eq. 15.271 in terms of ad, as indicated by eq. 14.641 

Thus let F G J^{g*) be an arbitrary smooth scalar function on g*. By the chain 
rule 

dF 

— = DF(x) ■x=<x; VF(x) > . (5.28) 
But applying eq.s 14.641 and I4.8;T1 to eq. 15.271 implies: 

{F,H}{x) =< X- [VF(x), Vi/(x)] >= - < a;;arfvH(x-)(VF(x)) >=< adyH^^){x)]V F{x) > . 

(5.29) 

Since F is arbitrary and the pairing is non-degenerate, we deduce that Hamilton's 
equations take the form 

dx 

— = ad*^H{x){^)- (5-30) 



7. 2. 4. A Example: so(3) and so(3)* As an example of the dual of a Lie algebra as a 
Poisson manifold, let us consider again our standard example so(3)*. We will thereby 
make good our promise in (2) of Section IKTT] to show that Euler's equations for a rigid 
body are already in Hamiltonian form — in our generalized sense. We will also see why 
in the Chapter's second motto, Arnold mentions the three dual spaces, lR^*,so(3)* and 
T*[SO{?>))g] (cf. the discussion at the end of Section l3. 4. 4|1 . 

The Lie algebra so(3) of 5*0(3) has a basis 61,62,63 representing infinitesimal ro- 
tations around the x-, y- and z-axes of IR^. As we have seen, we can think of these 
basis elements: as vectors in IR^ with [, ] as elementary vector multiplication; or as 
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anti-symmetric matrices with [, ] as the matrix commutator; or as left-invariant vector 
fields on 5*0(3) with [, ] as the vector field commutator (i.e. Lie bracket). 

Let ei, 62, €3 be a dual basis for so(3)*, with x = x^ti + x^e2 + x^e^ a typical point 
therein. If F : 5o(3)* IR, its gradient at x is the vector 

OF OF OF 

Then eq. 15.271 tells us that, if we write so(3) as with x for elementary vector 
multiplication, the Lie-Poisson bracket on so(3)* is 

I f dF^dl^ _ dF^dH\ ^ f dF^dl^ _ dF^dH\ 

' \dx^ dx"^ dx'^ dx^ J "' \dx'^ dx^ dx^ dx"^ J 

= -X ■ {VF xVH). (5.33) 

So the structure matrix J{x) is 

J{x) = xs -xi \ , X e 5o(3)*. (5.34) 






-X3 


Xi 







-Xi 


-X2 


Xi 






Hamilton's equations corresponding to the Hamiltonian function H{x) are therefore 

^ = xx VH(x) . (5.35) 
at 

Now consider the Hamiltonian representing the kinetic energy of a free pivoted rigid 
body 

(,36) 

in which the /j are the moments of inertia about the three coordinate axes, and the 
are the corresponding components of the body angular momentum. For this Hamilto- 
nian, Hamilton's equations eq. 15.351 become 

dx^ h-h 2 3 dx'^ h-h 3 1 dx^ h-h I 2 q^n 
— — = — — — X X , — — = — — — X X , — — = — — — X X , 5.37 
dt hh dt /3J1 dt IJ2 

Indeed, these are the Euler equations for a free pivoted rigid body. I shall not go into 
details about the rigid body. I only note that: 

(i) : In the elementary theory of such a body, the magnitude L of the angular 
momentum is conserved, and eq. 15.371 describes the motion of the on a sphere of 
radius L centred at the origin. 

(ii) : In Section we will return to seeing these spheres as the orbits of the co- 
adjoint representation of 50(3) on 5o(3)* (cf. Section |4. 5. 2|) . 

(iii) : Let us sum up this theme by saying, with Marsden and Ratiu (1999, p. 11) 
that here we see: 'a simple and beautiful Hamiltonian structure for the rigid body 
equations'. 
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5.3 The symplectic foliation of Poisson manifolds 

We first reformulate some ideas of Section l^^ in more coordinate- independent language, 
starting with Section I5.2.3t s idea of the structure matrix J(x) f Section l5.3.1|) . Then 
we discuss canonical transformations on a Poisson manifold f Section I5.3.2p . This will 
lead up to showing that any Poisson manifold is foliated by symplectic leaves (Section 
I5.3.3p . Finally, we state a generalization of Darboux's theorem; and again take 5o(3) 
as an example (Section l5.3.4|l . 

5.3.1 The Poisson structure and its rank 

We now pass from the structure matrix J, eq. I5.16( to a coordinate-independent object, 
the Poisson structure (also known as: co-symplectic structure), written B. Whereas J 
multiplied naive gradient vectors, as in eq. 15.171 and 15.221 B is to map the 1-form dH 
into its Hamiltonian vector field; as follows. 

At each point x in a Poisson manifold M, there is a unique linear map B^,, which 
we will also write as B 

B = B^ : T*M T^M (5.38) 

such that 

BMH{x)) = XHix). (5.39) 
For the requirement eq. 15.391 implies, by eq. I5.2U1 that for each j = 1, ...,m 

Since the differentials dx* span T*M, this fixes B^^, by linearity. B^.'s action on any 
one-form a = Hajdx^ is: 

B,.(«) = S„/^(x)a,^U (5.41) 

so that Bx is essentially matrix multiplication by J{x). Here, compare again eq. 15.211 
andEI221 

Here we recall that any linear map between (real finite-dimensional) vector spaces, 
B : V ^ W*, has an associated bilinear form 5* on K x W** = V x W given by 

B\v,w) := < B{v) ; w> . (5.42) 

Accordingly, some authors introduce the Poisson structure as a bilinear form B| : 
T*MxT*M IR, often called the Poisson tensor. Thus eq. EUgives, for a,(3e T*M 

Bl{a,(3) := < B{a),(3> . (5.43) 

B| is antisymmetric, since the matrix J(x) is. So, if we now let x vary over M, we 
can sum up in the traditional terminology of tensor analysis: B" is an antisymmetric 
contravariant two-tensor field. 
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Example: — Consider our first example, M = 1R^"~^' with the "usual bracket" eq. 
15.51 from the start of Section 15.2.11 For any one-form 

a = El^^ittidq' + kdp') + T}-^-^Cjdz^ (5.44) 

we have 

= ih~ - a.|-) . (5.45) 

In this example the form of B is the same from point to point. In particular, the 
kernel of B has everywhere the same dimension, viz. /, the number of distinguished 
coordinates. 

We now define the rank at a; of a Poisson manifold M to be the rank of its Poisson 
structure B at x, i.e. the dimension of the range of B^. This range is also the span of 
all the Hamiltonian vector fields on M at x: 

ran(B^) := {X G T^M : X = B^(a), some a G T*M} = {Xh{x) : H : M ^ IR smooth } . 

(5.46) 

So the rank of M at x is also equal to the dimension of B^^'s domain, i.e. dim(T^M)=dim(M), 
minus the dimension of the kernel, dim(B2;). 

Since in local coordinates, B^^ is given by multiplication by the structure matrix 
J{x), the rank of M at a; is the rank (the same in any coordinates) of the matrix J{x). 
That J{x) is anti-symmetric implies that the rank of M is even: cf. again the normal 
form of antisymmetric bilinear forms, eq. 12.21 and 12.31 

The manifold M being symplectic corresponds, of course, to the rank of B being 
everywhere maximal, i.e. equal to dim(M). 

In this case, the kernel of B is trivial, and any distinguished function H is constant 
on M. For H is distinguished iff X^ = 0; and if the rank is maximal, then dH = 0, so 
that H is constant. 

Besides, each of the Poisson structure and symplectic form on M determine the 
other. In particular, the Poisson tensor B" of eq. I5.43l is. up to a sign, the "contravariant 
cousin" of M's symplectic form uj. For recall: (i) the relation between a symplectic 
manifold's Poisson bracket and its form, eq. 12.181 viz. 

{F, H} = dFiXn) = uj{Xf, Xh) ; (5.47) 

and (ii) eq. 15.81 for Hamiltonian vector fields on a Poisson manifold, viz. 

Xh{F) = {F,H}. (5.48) 

Applying these equations yields, if we start from eq. 15.431 and eq. 15.391 

B\dH, dF) := < B{dH), dF >= dF^Xu) = Xh{F) = {F, H} = uj{Xf, Xh) ■ (5.49) 

We have also seen examples where the Poisson structure B is of non-maximal rank: 
(i): In our opening "canonical" example, the Poisson bracket eq. 15.51 on M = 
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IR has rank 2n everywhere. 

(ii): In the Lie-Poisson structure on 5o(3)*, the rank varies across the manifold: 
it is 2 everywhere, except at the origin x = where it is 0. (Cf. the rank of the matrix 
J in eq. EM) 

5.3.2 Poisson maps 

Already at the beginning of our development of Poisson manifolds, we saw that a scalar 
function H : M ^ M. defines equations of motion, with H as "Hamiltonian" , for all 
other functions F : M — > IR, of the familiar Poisson bracket type: 

F = {F,H}. (5.50) 

(Cf. Section I5.2.2[ especially the remarks around eq. 15.81 ) We now develop the 
generalization for Poisson manifolds of some related notions and results. 

We say that a smooth map / : Mi — ^ M2 between Poisson manifolds (Mi, {, }i) and 
(M2, {, 12) is Poisson or canonical iff it preserves the Poisson bracket. To be precise: 
we first need the idea of the puUback of a function; cf. Section [3. 1.21 A. In this context, 
the pullback /* of a function F : M2 — IR is given by 

f*F:=Fof; i.e. /*F : X e Ml ^ F(/(x)) G IR. (5.51) 

Then we say that / : Mi — ^ M2 is Poisson iff for all smooth functions F, G : M2 —>■ IR 

(F,Ge.F(M2)) 

r{F,G}2 = {rF,rG}i ■ (5.52) 

where by the definition eq. 15.511 the Ihs = {F, G}2 o /, and the rhs = {F o /, G o g}i. 

We note the special case where Mi = M2 =: M and M is symplectic; i.e. the 
Poisson bracket is of maximal rank, and so defines a symplectic form on M, as in eq. 
15.491 In this case, we return to the equivalence in Section ??'s usual formulation of 
Hamiltonian mechanics, between preserving the Poisson bracket and preserving the 
symplectic form. That is: a map / : M — > M on a symplectic manifold M is Poisson 
iff it is symplectic. 

Besides, we already have for symplectic manifolds an infinitesimal version of the 
idea of a Poisson or symplectic map: viz. the idea of a locally Hamiltonian vector 
field; cf. Section r2.1.3[ Similarly for Poisson manifolds, we will need the corresponding 
infinitesimal version of a Poisson map; but not till Section Rj. 1.11 

One can show (using in particular the Jacobi identity) that the flows of a Hamilto- 
nian vector field are Poisson. (Here of course, (Mi, {, }i) = (M2, {, }2).) That is: if (p^ 
is the flow of Xh (i.e. (pr = exp(rX/f)), then 

G} = i.e. {f, g} o 0, = {f o 0,, g o 0,} . (5.53) 

Similarly, one can readily show the equivalent proposition, that along the flow of a 
Hamiltonian vector field the Lie derivative of the Poisson tensor vanishes. That is: 



for any smooth function if : M ^ H, we have: 

£x^B« = 0. (5.54) 

Since preserving the Poisson bracket imphes in particular preserving its rank, it 
follows from eq. 15.531 (or from eq. I5.54|l that: 

If Xh is a Hamiltonian vector field on a Poisson manifold M, then for any r G IR 
and X G M, the rank of M at exp(rX//)(a:) is the same as the rank at x. In other 
words: Hamiltonian vector fields are rank-invariant in the sense used in the general 
form of Frobenius' theorem f Section I3.3.2j) . 

This result will be important for the foliation theorem for Poisson manifolds. 

We will also need the result (also readily shown) that Poisson maps push Hamil- 
tonian flows forward to Hamiltonian flows. More precisely: let / : Mi —>■ M2 be a 
Poisson map; so that at each x G Mi, we have the derivative map on the tangent 
space, Tf : (TMi)^ (TM2)f(x)- And let H : M2 — >■ IR be a smooth function. If 0^ 
is the flow of and ipr is the flow (on Mi) of Xho/, then: 

(t^T^ f = f °i^T and Tf o Xuof = Xh o f . (5.55) 



M2 (5.56) 

(f>T 
M2 

5.3.3 Poisson submanifolds: the foliation theorem 

To state the foliation theorem for Poisson manifolds, we need the idea of a Poisson 
immersion, which leads to the closely related idea of a Poisson submanifold. In effect, 
these ideas combine the idea of a Poisson map with the ideas about injective immersions 
in (2) of Section [3.3.11 We recall from that discussion that for an injective immersion, 
f : N M, the range f{N) is not necessarily a submanifold of M: but f{N) is 
nevertheless called an 'injectively immersed submanifold' of M. (But as mentioned in 
Section Hi. 3. 21 many treatments ignore this point: they in effect assume that an injective 
immersion / is also an embedding, i.e. a homeomorphism between N and f{N), so 
that f{N) is indeed a submanifold of M and / is a diffeomorphism.) 

An injective immersion f : N ^ M, with M a Poisson manifold, is called a Poisson 
immersion if any Hamiltonian vector field defined on an open subset of M containing 
f{N) is in the range of the derivative map of / at y G A^, i.e. ran(Ty/), at all points 
f ly) for yeN. 

Being a Poisson immersion is equivalent to the following rather technical condition. 

Characterization of Poisson immersions An injective immersion / : 
A^ — » M, with M a Poisson manifold, is a Poisson immersion iff: 



In particular, this square commutes: 
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if F,G : V C N ^ H, where V is open in N, and if F, G : t/ IR are 
extensions of F o f^^,G o /^^ : f{V) — ]R to an open neighbourhood U 
of f(y) in M, then {F,G} is well-defined and independent of the 
extensions. 

The main point of this equivalence is that it ensures that if / : — M is a Poisson 
immersion, then has a Poisson structure, and / : ^ M is a Poisson map. It is 
worth seeing how this comes about — by proving the equivalence. 

Proof. Let / : N^M be a Poisson immersion, and let F,G : V G A^— i>lR and let 
F,G:Ud f{V) ^ IR be extensions of F o f-\ G o f-^ : f{V) ^ IR. Then for yeV, 
there is a unique vector v G TA^j^ such that 

Xaifiy)) = {Tyf){v) . (5.57) 

So evaluating the Poisson bracket of F and G at f{y) yields, by eq. 15.81 

{F,G}{f{y)) = dF{f{y))-Xa{f{y)) = dF{f{y))-{Tyf){v) = d{Fof){y).v ^ dF{y)-v . 

(5.58) 

So {F, G}{f{y)) is independent of the extension F of Fof~^. Since the Poisson bracket 
is antisymmetric, it is also independent of the extension G of Go f^^. So we can define 
a Poisson structure on A^ by defining for any y in an open V G N 

{F,G}Niy):={F,G}Mifiy)). (5.59) 

This makes / : N—>-M a Poisson map, since for any F,G on M and any y E N, we 
have that 

[r{F,G}M]{y) ^ [{F,G}Mof]{y) = {F,G}M = {rFJ*G}N{y) ; (5.60) 

where the middle equality uses eq. 15.591 

For the converse implication, assume that eq. 15.581 holds, and let H : U -^IR be 
a Hamiltonian defined on an open subset U of M that intersects f{N). Then as we 
have just seen, A^ is a Poisson manifold and / : N—^M is a Poisson map. Because / is 
Poisson, it pushes Xuof to Xh- That is: eq. 15.551 implies that if y & N is such that 
f{y) G U, then 

Xnifiv)) = {Tyf){XHof{y)) . (5.61) 

So Xnifiy)) is in the range of Tyf; so / : N—^M is a Poisson immersion. QED. 

Now suppose that the inclusion id : N ^ M is a Poisson immersion. Then we call 
A^ a Poisson suhmanifold of M. We emphasise, in line with the warning we recalled 
from (2) of Section l3.3.1l that A^ need not be a submanifold of M; but it is nevertheless 
called an 'injectively immersed submanifold' of M. 

From the definition of a Poisson immersion, it follows that any Hamiltonian vector 
field must be tangent to a Poisson submanifold. In other words: writing X for the 
system of Hamiltonian vector fields on M, and X l^. for their values at s G M, we 



have: if is a Poisson submanifold of M, and x E N, X \xC TN^. 

For the special case where M is a symplectic manifold, we have X \x= T^M, and 
the only Poisson submanifolds of M are its open sets. 

Finally, we define the following equivalence relation on a Poisson manifold M. 
Two points Xi,X2 G M are on the same symplectic leaf if there is a piecewise smooth 
curve in M joining them, each segment of which is an integral curve of a locally 
defined Hamiltonian vector field. An equivalence class of this equivalence relation is a 
symplectic leaf. 

We can now state and prove that Poisson manifolds are foliated. 

7. 3. 3. A Foliation theorem for Poisson manifolds The result is: — 

A Poisson manifold M is the disjoint union of its symplectic leaves. Each 
symplectic leaf is an injectively immersed Poisson submanifold, and the 
induced Poisson structure on the leaf is symplectic. The leaf through the 
point X, Nx say, has dimension equal to the rank of the Poisson structure 
at x; and the tangent space to the leaf at x equals 

TN^ = ran(B^) := {X G T^M : X = B^(a), some a G T*M} (5.62) 

= {Xh{x) : H G J^(f/), U open in M } (5.63) 

Proof. We apply the general form of Frobenius' theorem (Section I3.3.2|) to the 
system X of Hamiltonian vector fields on M. We know from eq. 15.111 (Section I5.2.2|) 
that X is involutive, and from eq. l5.53l above that it is rank-invariant. So by Frobenius' 
theorem, X is integrable. The integral submanifolds are by definition given by the rhs 
of eq. IKIk^ QED. 

One also readily shows that: 

(i) : One can evaluate the Poisson bracket of F, G : M — IR at x G M by restricting 
F and G to the symplectic leaf through x, and evaluating the Poisson bracket that 
is defined by the symplectic form on the leaf Nr^; (i.e. the Poisson bracket defined in 
eq. EM- 

(ii) : A distinguished function is constant on any symplectic leaf of M. 

We end with two remarks. The first is a mathematical warning; the second concerns 
physical interpretation. 

(1): Recall our warning that symplectic leaves need not be submanifolds. This also 
means that all the distinguished functions being constants does not imply that the 
Poisson structure is non-degenerate. Indeed, one can readily construct an example in 
which the symplectic leaves are not manifolds, all distinguished functions are constants, 
and the Poisson structure is degenerate. Namely, one adapts an example mentioned 
before, in Section [3.4.31 the flows on the torus that wind densely around it. (For 
more details about this example, cf. Arnold (1973: 160-167) or Arnold (1989: 72-74) 
or Butterfield (2004a: Section 2.1.3.B); for how to adapt it, cf. Marsden and Ratiu 

Qfi 



(1999: 347). 



(2): As we have seen, any integral curve of any Hamiltonian vector field Xh is 
confined to one of the symplectic leaves. So if we are interested only in the behaviour 
of a single solution through a point x G M, we can restrict our attention to the 
symplectic leaf through x: for the solution will always remain in N^. But as 
stressed in Section 15.11 there are at least three good reasons not to ignore the more 
general Poisson structure! 

5.3.4 Darboux's theorem 

At the end of Section 12.1.11 we mentioned Darboux's theorem: it said that any sym- 
plectic manifold "looks locally like" a cotangent bundle. The generalization for Poisson 
manifolds says that any Poisson manifold "looks locally like" our canonical example 
on ]R"^,m = 2n + I, given at the start of Section 15.2.11 More precisely, we have: 

Let M be an m-dimensional Poisson manifold, and let a; G M be a point 
with an open neighbourhood U G M throughout which the rank is a con- 
stant 2n < m. Then defining / := m — 2n, there is a possibly smaller neigh- 
bourhood U' G U of X, on which there exist local coordinates (g, p, z) = 
(g^, z^, z'), for which the Poisson bracket takes the form 



(So the Poisson brackets for the coordinate functions take the now-familiar 
form given by eq. 15.61 and 15.71 ) The symplectic leaves of M intersect the 
coordinate chart in the slices {z^ = ci, . . . , = q} given by constant values 
of the distinguished coordinates z. 

We shall not give the proof. Suffice it to say that: 

(i) : Like Darboux's theorem for symplectic manifolds: it proceeds by induction on 
the "half-rank" n; and it begins by taking any function F as the "momentum" and 
constructing the canonically conjugate coordinate such that {q^,p^} = 1. 

(ii) : The induction step invokes a version of Frobenius' theorem in which the fact 
that the rank 2n is constant throughout U secures a coordinate system in which the 
2n-dimensional integral manifolds are given by slices defined by constant values of 
the remaining / coordinates. The Poisson structure then secures that these remaining 
coordinates are distinguished. 

7. 3.4. A Example: 3o(3)* yet again We illustrate (1) the foliation theorem and (2) 
Darboux's theorem, with so(3)*; whose Lie-Poisson structure we described in Section 



(1): At a; G so(3)*, the subspace X |^:= {Xh{x) : H G J^{U), U open in M } of 
values of locally Hamiltonian vector fields is spanned by ei := yd^ — zdy representing 




(5.64) 
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infinitesimal rotation about the a;-axis (cf. eq. ??); 62 := zdx — xdz for rotation about 
the y-axis; and 63 := xdy — yd^ for rotation about the 2;- axis. If x 7^ 0, these vectors 
span a two-dimensional subspace of Tso(3)*: viz. the tangent plane to the sphere S\x\ 
of radius | x \ centred at the origin. So the foliation theorem implies that so(3)*'s 
symplectic leaves are these spheres; and the origin. 

We can compute the Poisson bracket of F, G : ^la;!— s>lR by extending F and G 
to a neighbourhood of S\x\'i cf. eq. 15.591 That is: we can consider extensions 
F,G : U D S'l^.i^lR, and calculate the Poisson bracket in so(3)*, whose Poisson struc- 
ture we already computed in eq. 15.331 

Adopting spherical polar coordinates with r =\ x \, i.e. x^ = r cos^^ sin 0, = 
r sin 6 sin 0, = r cos0, we can define F, G merely by F{r, 6, 0) := F{6, (p), G{r, 6, 0) : = 
G{9, 0); so that the partial derivatives with respect to the spherical angles 9, are equal, 
i.e. F0 = Fg, F^ = F^, Gg = Gg, G^ = G^. 

Besides, eq. 15.151 implies that we need only calculate the Poisson bracket in 5o(3)* 
of the spherical angles 9 and 0. So eq. 15.331 gives 

{9, 0} = -X • (V^ X V0) = ; (5.65) 

r sm 

and eq. 15.591 and 15.151 give 

{F, G} = {F, G} = -^{FeG^ - F^Gg) . (5.66) 
r sm 

(2): z := x^ defines the Hamiltonian vector field Xz = x'^d^i —x^d^-^ that generates 
clockwise rotation about the z = x^-axis. So away from the origin the polar angle 
9 := arctan(x^/x^) has a Poisson bracket with z equal to: {9,z} = Xz{9) = — 1. 
Exprssing F,H : 5o(3)*— >-lR in terms of the coordinates z,9 and r : = | x |, we find 
that the Lie-Poisson bracket is: {F,H} = F^Hg — FgH^. So {z,9,r) are canonical 
coordinates. 

5.4 The symplectic structure of the co-adjoint representation 

Section 15.2.41 described how the dual g* of a finite-dimensional Lie algebra of a Lie 
group G has the structure of a Poisson manifold. In this case, the foliation established 
in the previous Subsection has an especially neat interpretation. Namely: the leaves 
are the orbits of the co-adjoint representation of G on g*. 

This symplectic structure in the co-adjoint representation sums up themes from 
Sections 14.51 fespeciallv 117^^ . and 15.2.41 and 1331 In particular, it connects two prop- 
erties of the Lie bracket in q, which we have already seen: viz. 

(i) : The Lie bracket in g gives the infinitesimal generators of the adjoint action; cf. 
eq. KM 

(ii) : The Lie bracket in g defines (in a basis-independent way) a Lie-Poisson bracket 
on g*, thus making g* a Poisson manifold. (Cf. the definition in eq. 15.241 shown to be 
basis-independent by eq. 15.271 ) 
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In fact, there is a wealth of instructive resuhs and examples about the structure of 
the co-adjoint representation: we will only scratch the surface — as in other Sections! 
We will give a proof, under a simplifying assumption, of one main result; and then 
make a few remarks about other results. 

The result is: 

The orbits of the co-adjoint representation are 0*'s leaves 

Let G be a Lie group, with its co-adjoint representation Ad* on g*. That 

is, recalling eq. 14.781 we have: 

Ad* -.G^ End(g*), Adl-i = {%{Rg o Lg-i))* . (5.67) 

The orbits of this representation are the symplectic leaves of g*, taken as 
equipped with its natural Poisson structure, i.e. the Lie-Poisson bracket 
eq. Km 

Proof: — We shall prove this under the simplifying assumption that the co-adjoint action 
of G on g* is proper. (We recall from the definition of proper actions, eq. 14.321 that 
for any compact Lie group, such as 5*0(3), this condition is automatically satisfied.) 
Then we know from result (3) and eq. 14.451 at the end of Section ^31 that this implies 
that the co-adjoint orbit Orb(a) of any a G g* is a closed submanifold of g*, and that 
the tangent space to Orb(a;) at a point (3 G Orb(a;) is 

rOrb(a)^ = : ^eg}. (5.68) 

We will see shortly how this assumption implies that g*'s symplectic leaves are sub- 
manifolds.^^ 

We now argue as follows. For ^ G g, consider the scalar function on g*, : a G 
g* K^{q) := < a; ^ > G 1R; and its Hamiltonian vector field X^^- At each a G g*, 
the gradient VK^{a) = dK^{a), considered as an element of (T*g*)a = g, is just ^ 
itself. Now we will compute XK^{F){a) for any F : g* — > IR and any a G g*, using in 
order: 

(i) : the intrinsic definition of the Lie-Poisson bracket on g*, eq. 15.271 

(ii) : the fact that the infinitesimal generator of the adjoint action is the Lie bracket 
in g, eq. 14.641 

(iii) : the fact that the derivative ad* of the co-adjoint action Ad* is, up to a sign, 
the adjoint of ad^; eq. 14.831 

Thus we get, for all F : g*-^lR and a G g*: 

XK^{F){a) = {F,Ki:}{a) = < a ; [VF{a),VK^{a)] > (5.69) 

= < a; [VF(a),e] > = - < a ; [^,VF{a)] > (5.70) 

= - < a; ad^{VF{a)) > (5.71) 

= < adlia) ; VF(a) > . (5.72) 

^^To verify that our condition is indeed simplifying — i.e. that in general the co-adjoint orbits in 
g* are not submanifolds — consider the example in Marsden and Ratiu (1999: 14.1.(f), p. 449); taken 
from KiriUov (1976: 293). 
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But on the other hand, the vector field Xk^ is uniquely determined by its action on all 
such functions F at all a G g*: 

XK,{F){a) = < XK,{a);VF{a) > . (5.73) 

So we conclude that at each a E g*: 

Xk^ = adl . (5.74) 

But the subspace A* |q, of values at a of Hamiltonian vector fields is spanned by the 
Xx^ia), with ^ varying through q. And as ^ varies through g, ac?|(a) is the tangent 
space TOrb(a)a to the co-adjoint orbit Orb(a) of G through a. So 

X U= TOrb(a)„ . (5.75) 

So the integral submanifolds of the system X of Hamiltonian vector fields, which are 
the symplectic leaves of g* by Section l5.3.3[ A's foliation theorem, are the co-adjoint 
orbits. QED. 

For the illustration of this theorem by our standard example, so(3)*, cf. our previous 
discussions of it: in Section 14.5.21 for its co-adjoint structure; in Section 15. 2. 4l A for its 
Lie-Poisson structure; and in Section 15. 3. 4l A for its symplectic leaf structure. 

We end this Subsection by stating two other results. They are not needed later, 
but they are enticing hints of how rich is the theory of co-adjoint orbits. 

(1) : For each g E G, the co-adjoint map Ad*g : g*^g* is a Poisson map that 
preserves the symplectic leaves of g*. 

(2) : A close cousin of the theorem just proven is that the Lie bracket on g defines 
(via its definition of the Lie-Poisson bracket on g*, eq. I5.27|) a symplectic form, i.e. a 
non-degenerate closed two-form, on each co-adjoint orbit, by: 

uj{a){ad*^{a),ad*{a)) := < a;[^,ri]g > , Va G g*, V^, r/ G g . (5.76) 

This theorem is proven in detail (without our simplifying assumption that G's action 
is proper) by Marsden and Ratiu (1999: Thm 14.3.1, pp. 453-456); and much more 
briefiy by Arnold (1989: 321, 376-377, 457); and rather differently (even without using 
the notion of a Poisson manifold!) in Abraham and Marsden (1978: 302-303). 

5.5 Quotients of Poisson manifolds 

We now end Section El with the simplest general theorem about quotienting a Lie group 
action on a Poisson manifold, so as to get a quotient space (set of orbits) that is itself a 
Poisson manifold. So this theorem combines themes from Sections 0] — in particular, the 
idea from Section HTSl B that for a free and proper group action, the orbits and quotient 
space are manifolds — with material about Poisson manifolds from Section 15.21 (The 
material in Sections 15.31 and 15.41 will not be needed.) This theorem will be important 
in Section [7| We call this result the 
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Poisson reduction theorem: Suppose the Lie group G acts on Poisson 
manifold M is such a way that each $g : M — > M is a Poisson map. 
Suppose also that the quotient space M/G is a manifold and the projection 
TT : M ^ M/G is a smooth submersion (say because G's action on M is free 
and proper, cf. Section I^^ B). Then there is a unique Poisson structure on 
M/G such that tt is a Poisson map. The Poisson bracket on M/G is called 
the reduced Poisson bracket. 

Proof. Let us first assume that M/G is a Poisson manifold and that vr is a Poisson 
map; and show uniqueness. We first note that for any / : M/G — > IR, the function 
/:=/o7r:M— i>lRis obviously the unique G-invariant function on M that projects 
by TT to /. That is: if [x] = Orb(x) = G ■ x is the orbit of x G M, then / assigns the 
same value /([x]) to all elements of the orbit [x]. Besides, in terms of puUbacks (eq. 

EHD, f = 7T*f. 

Then the condition that vr be Poisson, eq. 15.521 is that for any two smooth scalars 
/, h : M/ G — >■ IR, we have an equation of smooth scalars on M: 

{/, h}M/G OTC = {f on,ho n}M = {/, h}M (5.77) 

where the subscripts indicate on which space the Poisson bracket is defined. Since vr 
is surjective, eq. 15.771 determines the value {/, /i}m/g uniquely. 

But eq. 15.771 also defines {f,h}M/G as a Poisson bracket; in two stages. (1): The 
facts that $g is Poisson, and / and h are constant on orbits imply that 

UMig-^) = ({/,Mo<l>,)(x) = {/o$„/io$,}(a;) = {f,h}{x). (5.78) 

That is: {/, h} is also constant on orbits, and so defines {/, h} uniquely. 

(2): We show that {f,h}, as thus defined, is a Poisson structure on M/G, by 
checking that the required properties, such as the Jacobi identity, follow from the 
Poisson structure { , }m on M. QED. 

This theorem is a "prototype" for material to come. We spell this out in two brief 
remarks, which look forward to the following two Sections. 

(1) : Other theorems: — This theorem is one of many that yield new Poisson mani- 
folds and symplectic manifolds from old ones by quotienting. In particular, as we will 
see in detail in Section [71 this theorem is exemplified by the case where M = T*G (so 
here M is symplectic, since it is a cotangent bundle), and G acts on itself by left trans- 
lations, and so acts on T*G by a cotangent lift. In this case, we will have M/G = q*; 
and the reduced Poisson bracket just defined, by eq. 15.771 will be the Lie-Poisson 
bracket we have already met in Section 15.2.41 

(2) : Reduction of dynamics: — Using this theorem, we can already fill out a little 
what is involved in reduced dynamics; which we only glimpsed in our introductory 
discussions, in Section 12.31 and 15.11 We can make two basic points, as follows. 

(A): If if is a G-invariant Hamiltonian function on M, it defines a corresponding 
function h on M/G hj H = Hotc. The fact that Poisson maps push Hamiltonian flows 



forward to Hamiltonian flows (eq. I5.55P implies, since ir is Poisson, that tt transforms 
Xh on M to Xh on M/G. That is: 

TtcoXh = XhOTi ; (5.79) 

i.e. Xfj and Xh are vr-related. Accordingly, we say that the Hamiltonian system X^ 
on M reduces to that on M/ G. 

(B): We shall see in Section ing that G-invariance of H is associated with a family 
of conserved quantities (constants of the motion, first integrals), viz. a constant of the 
motion J(^) : M — IR for each ^ G 0. Here, J being conserved means {J, H} = 0; just 
as in our discussion of Noether's theorem in ordinary Hamiltonian mechanics (Section 
I2.1.3p . Besides, if J is also G-invariant, then the corresponding function j on M/G is 
conserved by Xh since 

{j, h} on = {J,H} = implies {j, h} = . (5.80) 

6 Symmetry and conservation revisited: momen- 
tum maps 

We now develop the topics of symmetry and conserved quantities (and so Noether's 
theorem) in the context of Poisson manifolds. At the centre of these topics lies the idea 
of a momentum map of a Lie group action on a Poisson manifold; which we introduce in 
Section 16.11 This is the modern geometric generalization of a conserved quantity, such 
as linear or angular momentum for the Euclidean group — hence the name. Formally, 
it will be a map J from the Poisson manifold M to the dual g* of the Lie algebra of the 
symmetry group G. Since its values lie in a vector space, it has components. So our 
description of conserved quantities will no longer be "one-dimensional", i.e. focussed 
on a single vector field in the state space, as it was in Sections ?? and ??. The map 
J will be associated with a linear map J from g to JF(M), the scalar functions on 
the manifold M. That is: for each ^ G g, J(^) will be a conserved quantity if the 
Hamiltonian H is invariant under the infinitesimal generator ^m, i-e. if ^m{H) = 0. 

The conservation of momentum maps will be expressed by the Poisson manifold 
version of Noether's theorem f Section 16. 2j) . and illustrated by the familiar examples 
of linear and angular momentum (Section 16. 3p . Then we discuss the equivariance of 
momentum maps, with respect to the co-adjoint representation of G on g*; Section 
16.41 Finally in Section 16.51 we discuss the crucial special case of momentum maps on 
cotangent bundles, again with examples. 

6.1 Canonical actions and momentum maps 

We first apply the definition of Poisson maps (from Section I5.3.2|) to group actions 
f Section I6.1.1|) . This will lead to the idea of the momentum map f Section I6.1.2p . 
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6.1.1 Canonical actions and infinitesimal generators 

Let G be a Lie group acting on a Poisson manifold M by a smooth left action $ : 
G X M ^ M; so that as usual we write $g : x G M i— > ^g{x) := g ■ x & M. As in the 
definition of a Poisson map (eq. I5.52j) . we say the action is canonical if 

^;{f,,F2} = {^;f,,^;f,} (e.i) 

for any Fi,F2 G J-'{M) and any g E G. If M is symplectic with symplectic form u, 
then the action is canonical iff it is symplectic, i.e. $*ci; = uj for all g E G. 

We will be especially interested in the infinitesimal version of this notion; and so 
with infinitesimal generators of actions. We recall from eq. 14.371 that the infinitesimal 
generator of the action corresponding to a Lie algebra element ^ G g is the vector field 

on M obtained by differentiating the action with respect to g at the identity in the 
direction ^: 

Uiix) = — [exp(rO ■ x] \r=o . (6.2) 
So we differentiate eq. 16.11 with respect to g in the direction ^, to give: 

^Mm,F,}) = {^m{F^),F2} + {F,,^m{F,)} . (6.3) 

Such a vector field C,m is called an infinitesimal Poisson automorphism. 

Side-remark: — We will shortly see that it is the universal quantification over g E G 
in eq. 16.11 and correspondingly in eq. 16.31 and 16.51 below, that means our description 
of conserved quantities is no longer focussed on a single vector field; and in particular, 
that a momentum map representing a conserved quantity has components. 

In the symplectic case, differentiating ^*uj = uo implies that the Lie derivative 
C^j^u) of u) with respect to ^ vanishes: C^j^jOJ = 0. We saw in Section 12.1.31 that 
this is equivalent to C,m being locally Hamiltonian, i.e. there being a local scalar 
J : U C M ^ JR such that = Xj. This was how Section 12.1.31 vindicated eq. 
I2.19t s "one-liner" approach to Noether's theorem: because the vector field Xf is locally 
Hamiltonian, it preserves the symplectic structure, i.e. Lie-derives the symplectic form 
CxfijJ = — as a symmetry should. 

We also saw in result (2) at the end of Section 13.2.21 that the "meshing" , up to a 
sign, of the Poisson bracket on scalars with the Lie bracket on vector fields implied that 
the locally Hamiltonian vector fields form a Lie subalgebra of the Lie algebra X{M) of 
all vector fields. 

Turning to the context of Poisson manifolds, we need to note two points. The first 
is a similarity with the symplectic case; the second is a contrast. 

(1) : One readily checks, just by applying eq. 16.31 that the infinitesimal Poisson 
automorphisms are closed under the Lie bracket. So we write the Lie algebra of these 
vector fields as V{M): V{M) C X{M). 

(2) : On the other hand. Section f2.1.3f s equivalence between a vector field being 
locally Hamiltonian and preserving the geometric structure of the state-space breaks 
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down. 

Agreed, the first implies the second: a locally Hamiltonian vector field preserves the 
Poisson bracket. We noted this already in Section l5.3.21 The differential statement was 
that such a field Xh Lie-derives the Poisson tensor: Cxh^^ = (eq. I5.54|) . The finite 
statement was that the fiows of such a field are Poisson maps: (p*{F, G} = {(p*F, (p*G} 
(eq. E3SD. 

But the converse implication fails: an infinitesimal Poisson automorphism on a 
Poisson manifold need not be locally Hamiltonian. For example, make IR^ a Poisson 
manifold by defining the Poisson structure 

IF H} = x(——-——] ■ (6 4) 

\dx dy dx dy J ' 

then the vector field X = d/dy in a neighbourhood of a point on the y-axis is a non- 
Hamiltonian infinitesimal Poisson automorphism. 

This point will affect the formulation of Noether's theorem for Poisson manifolds, 
in Section IHT^l 

Nevertheless, we shall from now on be interested in cases where for all ^, is 
globally Hamiltonian. This means there is a map J : g— >JF(M) such that 

= (6.5) 

for all ^ G 0. There are three points we need to note about this condition. 

(1) : Since the right hand side of eq. 16.51 is linear in ^, we can require such a J to 
be a linear map. For given any J obeying eq. 16. 5| we can take a basis ei, . . . , Cm of g 
and define a new linear J by setting, for any ^ = ^*ej, J(^) := ^V(ej). 

(2) : Eq. 16.51 does not determine J(0- For by the linearity of the map B : dJ{C,) ^— > 
-^J(C)! we can add to such a J(^) any distinguished function, i.e. an F : M — > IR such 
that Xp = 0. That is: Xj(^^)+f = -^JiO- (^^ course, in the symplectic case, the only 
distinguished functions are constants.) 

(3) : It is worth expressing eq. 16.51 in terms of Poisson brackets. Recalling that for 
any F,H E J^{M), we have Xh{F) = {F, H}, this equation becomes 

{F, AO} = UiiF) , VF e ^(M), Ve G . (6.6) 



We will also need the following result: 

^J{[^,v]) = ^{J{i;),J{v)}M ■ (6-7) 

To prove this, we just apply two previous results, each giving a Lie algebra anti- 
homomorphism . 

(i): Result (4) at the end of Section for any left action of Lie group G on any 
manifold M, the map ^ ^ C,m is a Lie algebra anti-homomorphism between and the 
Lie algebra Xm of all vector fields on M: 

(a^ + bri)M = a^M + brjM ; [^m, Vm] = -[^, v]m V^, t] E q, and a, 6 G M. (6.8) 



(ii): The "meshing" up to a sign, just as in the symplectic case, of the Poisson 
bracket on scalars with the Lie bracket on vector fields, as in eq. 15.111 at the end of 
Section EH 

X{F^H} = -[Xf^Xh] = [Xh,Xf] . (6.9) 

So for a Poisson manifold M, the map F G J^{M) ^ Xp G X{M) is a Lie algebra 
ant i-homomorphism . 

Applying (i) and (ii), we deduce eq. 16.71 bv: 

Xj([i,r,\) = [^,v]m = -[^M,r]M] = -[Xj(5),Xj(^)] = X{j(5),j(^)}^^ . (6.10) 

6.1.2 Momentum maps introduced 

So suppose that there is a canonical left action of G on a Poisson manifold M. And 
suppose there is a linear map J : g — J-'{M) such that 

Xji^)=^M (6.11) 

for all ^ G 0. 

The two requirements — that the action be infinitesimally canonical (i.e. each C,m ^ 
V{M)) and that each be globally Hamiltonian — can be expressed as requiring that 
there be a J : g — ^ J^{M) such that there is a commutative diagram. Namely, the map 
C, E g ^ C,M & V{M) is to equal the composed map: 

Q J^{M) V{M) . (6.12) 

Then the map J : M ^ g* defined by 

< J(a;) ; e > := J^x) (6.13) 

for all ^ G and x G M, is called the momentum map of the action. 

Another way to state this definition is as follows. Any smooth function J : M ^ g* 
defines at each ^ G g a scalar J(^) : x G M i-^ (J(x))(^) G IR. By taking J(^) as a 
Hamiltonian function, one defines a Hamiltonian vector field Xj{^^y But since G acts 
on M, each ^ G g defines a vector field on M , viz. ^m- So we say that J is a momentum 
map for the action if for each ^ E these two vector fields are identical: -^j(g) = Cm- 

Three further remarks by way of illustrating this definition: — 
(1): An isomorphism: — One readily checks that eq. 16.131 defines an isomorphism 
between the space of smooth maps J from M to g*, and the space of linear maps J 
from g to scalar functions J^{M). We can take J to define J by saying that at each 
X G M, J (a;) : ^ G g i— J(x)(C) G IR is to be given by the composed map 

g J^(M) ^ IR , (6.14) 



^^i—dxn = ^^,lJ'Kx)—— U ; (6.17) 



where \x means evaluation at a; G M . Or we can take J to define J by saying that at 
each ^ G 0, J(^) : x E M ^ J{^){x) G H is to be given by the composed map 

M^0*^]R, (6.15) 

where |^ means evaluation at ^ G g. 

(2): Differential equations for the momentum map: — Using Hamilton's equations, 
we can readily express the definition of momentum map as a set of differential equations. 
Recall that on a Poisson manifold, Hamilton's equations are determined by eq. 15.391 
which was that at each x G M 

^x{dH{x)) = Xh{x) ; (6.16) 

or in local coordinates x\i = 1, . . . ,m = dim(M), with J*-'(x) = {x\ x^} the structure 
matrix, 

~dx^ ^~ ^^dx^dx' 
(cf. eq. I5.41|) . So in local coordinates, Hamilton's equations are given by eq. 15.221 
which was: 

^ = Ef r'{x)^ . (6.18) 
dt ^ ^ ' dx^ ^ ' 

So the condition for a momentum map = is that for all ^ G g and all x G M 

B,(d(J(0)(x))=a/(a;) • (6.19) 
In coordinates, this is the requirement that for alH = 1, . . . , m 

J'H^f-^ = i^MYix) , (6.20) 

where — apologies! — the two Js on the left hand side have very different meanings. 

In the symplectic case, dim(M) = m = 2n and we have Hamilton's equations as 
eq. I2.15t viz. 

ix„uj:=coiXH,-) = dHi-) . (6.21) 
So the condition for a momentum map is that for all ^ 

uj{^M,-)=d{J{m-) ■ (6.22) 

In Hamiltonian mechanics, it is common to write the 2n local coordinates q,p as ^, i.e. 
to write 

^":=g", a = l,...,n ; C ■= Pa-n, a = n + 1, 2n . (6.23) 

So in order to express eq. 16.221 in local coordinates, let us temporarily write rj for the 
arbitrary element of g. Then writing r]M = (vm)"-^ and u^is := ^), eq. 16.221 

becomes 

cOa^imiT = ^ . (6.24) 



(3): Components: an example: — As discussed after eq. 16.31 we think of the collec- 
tion of functions J{^), as ^ varies through q, as the components of J. 

To take our standard example: the angular momentum of a particle in Euclidean 
space, in a state x = (q, p) is J(x) := qAp. Identifying so (3)* with IR'^ so that the nat- 
ural pairing is given by the dot product (cf. (3) at the end of Section ll.5.2|) . we get that 
the component of J (x) around the axis ^ e IR^ is < J(x) ; ^ >= ^(qAp). The Hamil- 
tonian vector field determined by this Hamiltonian function x = (q, p) ^ ■ (q A p) is 
of course the infinitesimal generator of rotations about the ^-axis. In Section 16 .31 we 
will see more examples of momentum maps. 

6.2 Conservation of momentum maps: Noether's theorem 

In ordinary Hamiltonian mechanics, we saw that Noether's theorem had a simple ex- 
pression as a "one-liner" based on the antisymmetry of the Poisson bracket: namely, 
in eq. 12.191 which was that for any scalar functions F, H 

Xf{H) = {H,F} = iff = {F, H} = Xh{F) . (6.25) 

In words: the Hamiltonian H is constant under the flow induced by F iff F is a con- 
stant of the motion under the dynamical flow X^- 

More precisely. Section r2 . 1 . 31 vindicated this one-liner as expressing Noether's theo- 
rem. For the one-liner respected the requirement that a symmetry should preserve the 
symplectic form (equivalently, the Poisson bracket), and not just (as in the left hand 
side of eq. I6.25|) the Hamiltonian function H; for, by Cartan's magic formula, a vector 
field's preserving the symplectic form was equivalent to its being locally Hamiltonian. 

For Poisson manifolds, the equivalence corresponding to this last statement fails. 
That is, as we noted in (2) of Section 16.1.11 an infinitesimal Poisson automorphism 
need not be locally Hamiltonian. 

Nevertheless, most of the "one-liner" approach to Noether's theorem carries over 
to the framework of Poisson manifolds. In effect, we just restrict discussion to cases 
where the relevant Hamiltonian vector fields exist: recall our saying after (2) of Section 
I6.1.1l that we would concentrate on cases where all the C,m are globally Hamiltonian. 

Thus, it is straightforward to show that for a Poisson manifold M, just as for 
symplectic manifolds: if F, G JF(M), H is constant along the integral curves of Xp 
iff {if, F} = iff F is constant along the integral curves of Xh- (We could have proved 
this already in Section 15. 2. 2| but postponed it till now, when it will be used.) 

With this result as a lemma, one immediately gets 

Noether's theorem for Poisson manifolds Suppose that G acts canon- 
ically on a Poisson manifold M and has a momentum map 3 : M ^ g*; 
and that H is invariant under for all ^ G g, i.e. {H, J{^)} = ^m{H) = 
0, e 0; (cf. eq. 16. 6|) . Then J is a constant of the motion determined by 
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H. That is: 

where (pr is the flow of X^- 



J O 0^ = J 



(6.26) 



Proof. By the lemma, the fact that {H,J{^)} = ^m{H) = implies that J(^) is 
constant along the flow of Xh- So by the definition of momentum map, eq. I6.13[ the 
corresponding g*-valued map J is also a constant of the motion. QED. 

It follows immediately that H itself, and any distinguished function, is a constant 
of the motion. Besides, as remarked in (2) at the end of Section 16.1.11 a constant 
of the motion J(^) is determined only up to an arbitrary choice of a distinguished 
function. Indeed, though this Chapter has set aside (ever since (iii) of Section II. 2|) 
time-dependent functions: if one considers them, then there is here an arbitrary choice 
of a time-dependent distinguished function. 



6.3 Examples 

We give two familiar examples; and then, as a glimpse of the general power of the 
theory, two abstract examples (which will not be needed later on). 

(1): Total linear momentum of N particles : — 
In (3) at the end of Section I4.11 A. we showed that the left cotangent lift of the ac- 
tion of the translation group IR^ on Q = IR^^ to M = T*1R^^, i.e. the left action 
corresponding to eq. 14. IH is 

^,(q„ pO := r*($_,)(q„ = (q^ + x, p% t = l,...,N. (6.27) 

(Here we combine the discussions of examples (vi) and (ix) in Section I^!T1 A.) 

To find the momentum map, we: (a) compute the infinitesimal generator C,m for an 
arbitrary element ^ of g = H^; and then (b) solve eq. 16.221 or in coordinates eq. 16.241 

(a) : We differentiate eq. 16.271 with respect to x in the direction C,, getting 

6f(qi,P') = (e,-,e,0,...,0). (6.28) 

(b) : Any function J{^) has Hamiltonian vector field 



so that the desired J(^) with Xj(^) = solves 



(6.29) 



^ = e and ^ = 0, l<z<Ar. (6.30) 

Choosing constants so that J is linear, the solution is 

^(0(q.pO = (SiIiPO-e, i.e. J(q„pO = S,^iP^ (6.31) 
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i.e. the familiar total linear momentum. 

(2) : Angular momentum of a single particle : — 

5*0(3) acts on Q = JR^ by $A(q) = ^Q- So the tangent (derivative) map is 

: (q, v) e TUl ^ (Aq, Av) G TIR^ ^ . (6.32) 

As we saw in example (vii) of Section I4.11 A. the left cotangent lift of the action to 
M = T*1R^ (the lifted action "with g~^" , corresponding to eq. I4.11|) is: 

TX^{<l>A-^){q,p) = {Aq,Ap) . (6.33) 

To find the momentum map, we proceed in two stages, (a) and (b), as in example (1). 

(a) : We differentiate eq. 16.331 with respect to A in the direction ^ = Q{lj) G 5o(3), 
where a; G IR^ and 9 is as in eq. 13.201 and 13.231 We get 

eM(q, P) = (eq, ep) = (c^ A q, A v) . (6.34) 

(b) : So the desired J{^) is the solution linear in ^ to the Hamilton's equations 

a,>d ^f^^-ep. (6.35) 

So a solution is given by 

^(0(q, P) = (^q) ■ P = (^ A q) ■ p = (q A p) ■ a; , (6.36) 

so that 

J(q,p) = qAp, (6.37) 

i.e. the familiar angular momentum. 

(3) : Dual of a Lie algebra homomorphism : — 

We begin by stating a Lemma, which we will not prove; for details cf. Marsden and 
Ratiu (1999: 10.7.2, p. 372). Namely: let G,H be Lie groups and let a : g ^ P) be 
a linear map between their Lie algebras. Then a is a Lie algebra homomorphism iff 
its dual a* : [)* — *• g* is a (linear) Poisson map (where l)*,g* are equipped with their 
natural Lie- Poisson brackets as in Section 15. 2. 4j) . 

Now let G, H be Lie groups, let A : if — G be a Lie group homomorphism, and 
let a : P) — s> g be the induced Lie algebra homomorphism; so that by the Lemma, 
a* : Q* ^ P)* is a Poisson map. We will prove that a* is also a momentum map for the 
action of H on g* given by, with /i G if, x G g*: 

^{h,x) = h- X := Ad\^^f^y^x . (6.38) 

Proof. We first recall the adjoint and co-adjoint actions Adg : g — * g and Ad*g : 
0* ^ 0*5 ill particular, eq. 14.761 So the action in eq. 16.381 is: 

Va;Gg*,VeGg : < h ■ x;^ > = < x; AdA{h)-^^ > ■ (6.39) 
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As usual, we compute for G f), the infinitesimal generator r/g. at s G 0* by differ- 
entiating eq. 16.391 with respect to h at e in the direction r/ G f). We get (cf. eq. 

EM- 

< Vs'i^)'^^ > = - < x;ada(n)^ > = < adl^^){x);^ > . (6.40) 
We define J(x) := a*{x): that is, 

J{t]){x) = < J{x); ?]>■=< a*{x); t] > = < x; a{T]) > ; (6.41) 

which implies 

V^Jiv) = a{r]) . (6.42) 

Now we recall that Hamilton's equations for J^t]) as the Hamiltonian are (cf. eq. I5.30|) 

X = Xj(^)(x) = arfv^j(^)(x) . (6.43) 

Combining eq. l6.4(Jl to eq. 16.431 we get: 

Xj(^) (x) = adl^^) {x) = r]g* (x) ; (6.44) 

proving that J{x) := a*{x) is a momentum map. QED. 

(4): Momentum maps for subgroups : — 
Assume that J : M — >^ g* is a momentum map for a canonical left action of G on M; 
and let if < G be a subgroup of G. Then H also acts canonically on M, and this 
action has as a momentum map the restriction of J's values to f) C g. That is: the 
map 

Jh-M-^[)* given by 3 h{x) := 3{x) \t, . (6.45) 

For the canonical action of G ensures that if G P) C 5, then rjM = ^J{ri)- Then 
Jh{v) •= J{v)^V ^ ^ defines a momentum map for H's action. That is 

\fx e M, \/r] ei) : < 3h{x); v> = < 3{x); rj > . (6.46) 



6.4 Equivariance of momentum maps 

In (1) of Section we defined the general notion of an equivariant map f : M ^ N 
between manifolds as one that respects the actions of a group G on M and on N: eq. 
14.291 We now develop an especially important case of this notion: the equivariance of 
momentum maps 3 : M ^ q*, where the action on g* is the co-adjoint action, eq. 14.771 

For us, this notion will have two main significances: — 

(i) : many momentum maps that occur in examples are equivariant in this sense; 

(ii) : equivariance has various theoretical consequences: in particular, momentum 
maps for cotangent lifted actions are always equivariant fSection l6.5|) . and equivariance 
is crucial in theorems about reduction (Section [7j). 

In this Section, we will glimpse these points by: 

(i) : defining the notion, and remarking on a weakened differential version of the 
notion f Section 16.4.11) : 

(ii) : proving that equivariant momentum maps are Poisson f Section I6.4.2j) . 



6.4.1 Equivariance and infinitesimal equivariance 

Let $ be a canonical left action of G on M, and let J : M — ^ g* be a momentum map 
for it. We say J is equivariant if for all (7 G G 

J o $3 = Adl-, o J ; (6.47) 

cf. eq. 14.291 and the definition of co-adjoint action, eq. 14.781 

M ^ Q* (6.48) 



•fa 



Ad* 



M Q* 

An equivalent formulation arises by considering that we can add to the commutative 
square in eq. 16.481 the two commutative triangles: 

M^IR is M^g*^lR; (6.49) 
representing the fact that J(^)(x) = J(a;)(,^); and 

0*^1R is fl* ^-t' 0* IR ; (6.50) 

representing the fact that for all rj & g* 

< Adl-r{r]);Adg{^) > = < r]-Adg-iAdg{0 > = <v;^> ■ (6.51) 

Eq.s 16.4^ and implv that an equivalent formulation of equivariance is that for all 
X E M,g E G and ^ G (and with g ■ x = ^g{x)) 

3{g ■ x){AdgO = J{Adg0i9 ■ x) = J(0(x) = J(x)(e) . (6.52) 

In (2) of Section 14.41 we differentiated the general notion of an equivariant map, 
and got the weaker differential notion that the infinitesimal generators and of 
the actions of G on M and on are /-related. 

Here also we can differentiate equivariance, and get the notion of infinitesimal 
equivariance. But 1 will not go into details since: 

(i): we will not need the notion, not least because (as mentioned above), many 
momentum maps are equivariant; 

(i): under certain common conditions (e.g. the group G is compact, or is connected) 
an infinitesimally equivariant momentum map can always be replaced by an equivariant 
one. 

So let it suffice to say that infinitesimal equivariance is theoretically important. In 
particular, the result eq. 16.71 viz. 

^J{K,v]) = ^{J{0,Jiv)}M (6-53) 
1 1 1 



implies that 



m,v)--=A[^,v])-{AO.Av)}M 



(6.54) 



is a distinguished function on the Poisson manifold M, and so constant on every sym- 
plectic leaf. 

This makes it natural to ask when E = 0. After all, cf. eq. 16.121 Both ^ ^ C,m and 
F I— > Xp are Lie algebra anti-homomorphisms. So it is natural to ask whether J is a 
Lie algebra homomorphism, i.e. whether S = 0. And it turns out that infinitesimal 
equi variance is equivalent to E = 0. 

6.4.2 Equivariant momentum maps are Poisson 

The following result is important, both as a general method of finding canonical maps 
between Poisson manifolds, and for the Lie- Poisson reduction theorem of Section [7| 

Equivariant momentum maps are Poisson Let J : M — g* be an 
equivariant momentum map for a canonical left action of G on a Poisson 
manifold M. Then J is a Poisson map: for all Fi, F2 G jF(g*), 



Proof. — We will relate (i) the left hand side, then (ii) the right hand side of eq. 
l6.55l to J; and finally we will use the fact that the Poisson bracket on M depends only 
on the values of the first derivatives. 



(i): Let x E M,a = J{x) G 0*; and let ^ = VFi and r] = VF2 evaluated at a, so 
that ^,776 g** = Q. Then 

{Fi,F2}g,(J(x)) a; [VFi, VF2] >=< a; [^,r]] >= ■m,v])ix) = {J(0,^(^)}W ; 



where the third equation just applies the definition of J, eq. 16.131 and the fourth 
equation uses (infinitesimal) equivariance. 

(ii): We show that (Fio J)(x) and J(^)(x) have equal x-derivatives. For any x G M 
and Vx &TxM 



d(Fi o J)(x) ■ V, = dFi(«) ■ TJ{v,) =< TJ{v,); VFi >= dJ(e)(a;) ■ ; (6.57) 



where the first equation uses the chain rule, and the last uses the definition of J, eq. 
EH and the fact that ^ = VFi. 

Finally, since the Poisson bracket on M depends only on the values of the first 
derivatives, we infer from eq. 16.571 that 



J*{Fi, F2},* = { J*Fi, J*F2}m ; i.e. {F,, F^},, o J 



{Fi o J, F2 o J}a, . 

(6.55) 



(6.56) 



{F,oJ,F2o3}{x) = {J{0,Av)}{x). 



(6.58) 



Combining this with (i), the result follows. QED. 
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6.5 Momentum maps on cotangent bundles 

Let a Lie group G act on a manifold ("configuration space") Q. We saw in Section 
I4.11 A that this action can be lifted to the cotangent bundle T*Q\ cf. eq.s 14.61 14.91 and 
14.111 In this Section, we focus on momentum maps for such cotangent lift actions. We 
shall see that any such action has an equivariant momentum map, for which there is 
an explicit general formula. The general theory (Sections 16.5.11 16.5.2j) will need just 
one main new notion, the momentum function. We end with some examples (Section 

6.5.1 Momentum functions 

Given a manifold Q and its vector fields X{Q), we define the map 

V : X{Q) ^ .F(T*g) by : {V{X)){a,) := < a,- X{q) > (6.59) 

for q E Q,X & X{Q) and G T*Q. Here, ag is, strictly speaking, a point in the 
cotangent bundle above the base-point q E Q: so can be written as (g, a) with a a 
covector at g, i.e. a e T*Q. But as we mentioned just before defining cotangent lifts 
(eq. 14. 6p : it is harmless to (follow many presentations and) conflate a point in T*Q, 
i.e. a pair (g, a), g G Q, a G T*Q^ with its form a, provided we keep track of the q by 
writing the form as a^. 

V{X), as defined by eq. 16.591 is called the momentum function of X. In coordinates, 
V{X) is given by 

V{X)iq\p,) = X^iq% (6.60) 

where we sum on j = l,...,n := dim Q. (So NB: This V is different from that in 
V{M), the infinitesimal Poisson automorphisms of M, discussed in Section [6.1.11 ) 

We also denote by C{T*Q) the space of smooth functions F : T*Q IR that are 
linear on fibres of T*Q: i.e. writing the bundle points a^, /3g G T*Q as (g, a) and (g, /3), 
we have for A, G IR 

F(g, (Aa + ^l(3)) = XF{{q, a)) + fiF{{q, (3)) . (6.61) 
So functions F, H that are in C{T*Q) can be written in coordinates as (summing on 

t = 1 ^ . . . ^ Tlj 

F{q,p) = X\q)pi and H{q,p) =Y'{q)pi (6.62) 

for functions X* and F*; and so any momentum function V{X) is in C{T*Q). 

One readily checks that the standard Poisson bracket (from T*Q's symplectic struc- 
ture. Section [2. 1.1|) of such an F and H is also linear on the fibres oiT*Q. In fact, eq. 
16.621 implies 

rr. ..w X OFdH OH OF fdX\. dY\^\ 



So C{T*Q) is a Lie subalgebra of J^(T*Q). 

The next result summarizes how momentum functions relate i^iQ) and Hamiltonian 
vector fields on T*Q to C{T*Q). 

Three (anti)-isomorphic Lie algebras The two Lie algebras 
(i) {X{Q), [, ]) of vector fields on Q; 

ill) Hamihonian vector fields Xp on T*Q with F e C{T*Q) 
are isomorphic. And each is anti-isomorphic to 
(iii) (£(T*Q),{,}). 

In particular, the map V is an anti-isomorphism from (i) to (iii), so that 
we have 

{V{X),ViY)}T*Q = -Vi[X,Y]) . (6.64) 

Proof: Since V{X) : T*Q ^ IR is linear on fibres, V maps X{Q) into C{T*Q). V is 
also onto C{T*Q): given F G C{T*Q), we can define X{F) G X{Q) by 

<«,;X(F)(g) > := F(«,) Va, G T;g (6.65) 

so that V{X{F)) = F. V is linear and V{X) = implies that X = 0. Also, eq. KM 
follows immediately by comparing eq. 16.631 with the Lie bracket of X, F G X{Q); cf. 
eq. 13.271 So V is an anti-isomorphism from {XQ, [, ]) to {C{T*Q), {, }). 
The map 

F G (£(T*Q), {, }) ^ G ({X^ I F G /:(T*g)}, [, ]) (6.66) 

is surjective by definition. It is a Lie algebra anti-homomorphism, by eq. 13.321 (i.e. 
result (2) in Section f3.2.2j) . And if Xp = 0, then F is constant on T*Q; and hence 
F = since F is linear on the fibres (cf. eq. I6.61|) . QED. 

6.5.2 Momentum maps for cotangent lifted actions 

We begin this Subsection with a result relating the Hamiltonian flow on T*Q induced 
by the momentum function V{X) to the Hamiltonian flow on X induced by X. From 
this result, our main result — the guarantee of an equivariant momentum map for a 
cotangent lifted action, and an explicit formula for it — will follow directly. 

The Hamiltonian flow of a momentum function Let X G X{Q) have 
flow (pr on Q; cf. Section r3.1.2I B. Then the flow of X-p(^x) on T*Q is T*0_^. 
That is: the flow of X-p(x) is the cotangent lift (Section I4.1I A) of as 
given by the diagram, with ttq the canonical projection: 

Q ^ Q (6.67) 

T*Q T*Q 
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Proof. We differentiate the relation in eq. 16.671 i.e. 

7iQ o T*0_^ = (j)^o7iQ (6.68) 

at r = to get 

Tttq o r = X o ttq with Va, e T*Q, Y{a,) = ^ |,=o T*(j)_r{a,) ; (6.69) 

i.e. T*0_^ is the ffow of y. 

Now we will show that Y = X-p(x) , using eq. 16.691 and the geometrical formulation 



of Hamiltonian mechanics of Section 12.11 especially Cartan's magic formula, eq. \2.20\ 
applied to the canonical one-form 6 = Oh (defined by eq. 12.81 and imjl . 

We reported (at the start of (2) of Section 14. II A) that the cotangent lift T* 
preserves 9 = 9h on T*Q. So Cyd = 0. Then the definition of u as the negative 
exterior derivative of 6, and Cartan's magic formula, eq. 12.201 yields 

iycu = -iyde = diY0 . (6.70) 

On the other hand, we also have 

iyOia^) =< 9M, Y{ag) >=< a,; T7iQ{Y{a,)) >=< a,, X{q) >= V{X){a,) (6.71) 



where the second equation applies the definition of the canonical one-form (eq. 
the third applies eq. 16.691 and the fourth applies the definition eq. 16.591 of momentum 
functions. 

Combining eq. 16.701 and 16.711 we have: 

\yuj = dr{X) (6.72) 

which is Hamilton's equations (eq. I2.15|) telling us that Y = X-pi^x)- QED. 

Accordingly the Hamiltonian vector field X-p(^x) on T*Q is called the cotangent lift 
of X G Q to T*Q. In local coordinates, we can write, by combining eq. ?? and 16.601 

'^(^) Qqi Qqi Qp. Qqi Qgj ^ Qp- 

Note in particular that, combining the usual sign-change between Lie algebras and 
Poisson brackets (eq. I3.32|) with the sign-change for momentum functions (eq. I6.64|) . 
we have 

[Xv{x),X'p(j)\ = -X{p(x),p(y)} = -X_p([x,y]) = Xp([x,y]) . (6.74) 

We can now readily prove our main result guaranteeing, and giving a formula for, 
equivariant momentum maps. 
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Equivariant momentum maps Let G act on the left on Q and so by 
cotangent lift on T*Q. The cotangent lifted action has an equivariant mo- 
mentum map J : T*Q —>■ q* given by 



(6.75) 



In coordinates q\pi on T*Q and on g, and with C,q = ■C^^a compo- 
nents of ^Q, this reads 



■a 



(6.76) 



so that Ja{q,p) = PiAHq). 
Proof: The preceding result tells us that for any ^ G g, the infinitesimal generator of 



This gives eq. I6.75t just by applying the definitions of the momentum map J (eq. I6.13|) 
and of momentum function (eq. I6.59|) . 

To prove equi variance, we argue as follows: 



Here we have applied in succession: (i) eq. 16.751 (ii) the fact that g ■ aq is short for 
T* ($3-1 )(«,), cf. eq. \rn\ andWB (iii) the definition of pullback, cf. eq. 071 (iv) 
result [2], eq. 14.521 of Section 14.5.11 (v) eq. 16.751 again: and finally, (vi) the fact that 
Ad* is the adjoint of Ad, cf. eq. liTHl QED. 

6.5.3 Examples 

We discuss first our familiar examples, linear and angular momentum i.e. (1) and (2) 
from Section 16.31 and then the cotangent lift of left and right translations on G — an 
example motivated by Section l^^ s description of kinematics on a Lie group G. 

(1): Total linear momentum of N particles: — 
Since the translation group IR'^ acts on Q := ]R^^ by $(x, (qj)) = (q^ + x), the 
infinitesimal generator on Q is 




< J(^ ■ «<7);^ > = <{9- (^q);^Q{9 ■ q) > 
< aq] {T^g-i)iQ{g ■q)> = <ag; {Tg.g<l>g~i o o (!>g){q) 

= < aq] {Adg-i^)Q{q) > 
= < 3{aq);Adg^i^ > = < Ad*-i(J(ag)); ^ > 



(6.78) 
(6.79) 
(6.80) 
(6.81) 
(6.82) 



= N times) 



(6.83) 



lift 



Applying eq. 16.751 the equivariant momentum map is given by 

Jm^^,p') = {^llP')■^, i-e. J(q.,pO = S^iP^ (6.84) 

agreeing with our previous solution, eq. 16. 3H based on the differential equation eq. 
KM 

(2) : Angular momentum of a single particle: — 

30(3) acts on IR^ by q) = Aq. Writing ^ G so (3) as ^ = Gcu (cf. eq. Km EM 
and 13.77]) . the infinitesimal generator is 

e]R3(q) =eq = ^Aq. (6.85) 

So applying eq. 16.751 the equivariant momentum map J : T*1R2 ^ so(3) ^ IR^ IS given 
by 

< J(q, p); u; >=< p; cjAq >= p- (cjAq) = u;- (qAp) , i.e. J(q, p)=qAp; (6.86) 

agreeing with our previous solution, eq. 16.371 based on the differential equation eq. 
KM 

(3) : The cotangent lift of left and right translations on G: — 
Recalling eq. I4.42t viz. that the infinitesimal generator of left translation is 

^aig) = {T,Rg)i , (6.87) 

a ng'/it-invariant vector field, and applying eq. 16.751 we see that the momentum map 
■.T*G ^ Q* for the cotangent lift of left translation is given by 

< JL(a,);e > = < ag^iaig) > = <ag- {T,Rg)i > = < {T:Rg){ag)-i > (6.88) 

where the last equation applies the definition of the cotangent lift eq. 14.61 That is: the 
equivariant momentum map is 

Ji(«,) = T:Rg{ag) . (6.89) 

In words: the momentum map J^, of the cotangent lift of left translation is the cotangent 
lift of right translation. 

In a similar way, we could consider right translation: Rg : h ^ hg. Right translation 
defines a right action on G, has C,G{g) = {TeLg)^ as its infinitesimal generator, and so 
has 

Jh : T*G ^ B* ; J^(a,) := T;L,(a,) (6.90) 

as the momentum map of its cotangent lift. Note that this momentum map is equiv- 
ariant with respect to Ad*^: which, as discussed after eq. I4.76[ is a right action. 
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7 Reduction 



7.1 Preamble 

In this final Section, the themes of Section |21 onwards come together — at last! As 
announced in Section 15.11 we will concentrate on proving what is nowadays called the 
Lie-Poisson reduction theorem: that is, the isomorphism of Poisson manifolds 

T*G/G ^ Q* . (7.1) 

Here the quotient of T*G is by the cotangent lift of G's action on itself by left trans- 
lation. 

As it happens, this Chapter's main sources (i.e. Abraham and Marsden (1978), 
Arnold (1989), Olver (2000) and Marsden and Ratiu (1999)) do not contain what is 
surely the most direct proof of this result. So we give it in Section 17.21 The result 
will follow directly from four previous main results, one from Section Eland three from 
Section IHl 

'Directly', but for one wrinkle! This relates to "flipping" between left and right 
translation, and their various lifts. In short: the four previous results show that T*G/G 
is isomorphic as a Poisson manifold, not to g* with the Lie-Poisson bracket familiar 
since eq. 15.241 and I5.27| but instead to g* equipped with this bracket's negative, i.e. 
equipped with 

{F,H}4x) := - <x;[VF{x),VH{x)]> , x e Q* . (7.2) 

But we shall (mercifully!) not reproduce, with minus signs appropriately added, our 
entire discussion of the Lie-Poisson bracket that ensued after eq. 15.241 (exercise for the 
reader!). 

To avoid ambiguity, we shall sometimes write for g* equipped with the positive 
Lie-Poisson bracket of eq. 15.27t and gl for g* equipped with the negative Lie-Poisson 
bracket of eq. 17.21 

In fact, it will be clearest from now on, to treat right actions on a par with left 
actions; despite our previous emphasis on the latter. This will mean that we will also 
treat right-invariant vector fields (and another notion of right-invariance defined in 
Section 17.3. 1|) on a par with left-invariant vector fields (and Section 17.3. li s correspond- 
ing new notion of left-invariance). Indeed, we have already glimpsed this would be 
necessary in: 

(i) : Section E31s result that the infinitesimal generator of left translation is a right- 
invariant vector field, and vice versa (eq. 14.421 14.43|) : and its corollaries in Example 
(3) of Section 16.5.3( that 

(ii) : the momentum map Jl of the cotangent lift of left translation is the cotangent 
lift of right translation; (eq. 16.89|) : and 

(iii) : the momentum map Jj^ of the cotangent lift of right translation is the cotan- 
gent lift of left translation; (eq. 16.90|) . 



lis 



So by the end of Section 17^ we will have a short proof of the Lie-Poisson reduction 
theorem. But (as often happens), the most direct proof does not give very much 
information about the situation. So in Section 17^ we give more information (following 
Marsden and Ratiu (1999)). Then in Section 17^ we discuss the reduction of dynamics 
(as against Poisson structure) from T*G to g*. 

Finally, in Section 1731 we state another reduction theorem, which is cast in terms of 
symplectic, not Poisson, manifolds — but which uses several notions from Sectional such 
as free and proper actions, and isotropy groups. But we do not prove this theorem: we 
include it mostly in order to emphasize our previous remark, that (despite its length!) 
this Chapter just scratches the surface of the subject. We also discuss the relation 
between it and the Lie-Poisson reduction theorem. 

7.2 The Lie-Poisson Reduction Theorem 

First we recall from the end of Section l¥. 6. 21 feq. I4.112|) that A : T*G ^ G x g* is an 
equivariant map relating the cotangent lifted left action of left translation on T*G to 
the G-action on G x g* given just by left translation on the first component. So we 

passed to the quotients, and defined A : T*G/G {G x g*)/G by eq. 14.1161 viz. 

A : Orb(a) = {(3 e T*G \ (3 = T*Lh-i{a), some h e G} ^ (7.3) 
Orb(A(a)) = {{hg, {T*Lg){a)) \ some h e G} = {{h, {T*Lg)a) \ some h e G} . (7.4) 

where a G T*G, so that T*Lh-ia G T^gG. Finally, we identified (G x g*)/G with g*, 
so that the diffeomorphism A maps T*G/G to g*, as in eq. 14.1171 

A : Orb(a) = {l3 e T*G \ (3 = T*Lh-i{a), some /i G G} G T*G/G ^ {T*Lg){a) G g*. 

(7.5) 

So now, we are to show that the diffeomorphism A : T*G/G — > g* is a Poisson map, 
in the sense of eq. 15.521 f Section l5. 3. 2|) . So we need to show: 

(i) : T*G/G is a Poisson manifold; 

(ii) : A maps (i)'s Poisson structure on T*G/G to that of g*. In fact, as announced 
in Section 17m A maps on to the Poisson structure of gl, i.e. as given by eq. 17.21 

Prima facie, there could be a judicious choice to be made about (i), i.e. about how 

to define the Poisson structure on T*G/G, so as to secure (ii), i.e. so that A respects 
the Poisson structure. But in fact our previous work gives a pre-eminently obvious 
choice — which works. Namely: we use the Poisson structure induced on T*G/G by 
the Poisson reduction theorem of Section 1331 The result follows directly by combining 
with this theorem, three results from Section [HI 

(i) : that equivariant momentum maps are Poisson; eq. 16.551 in Section [6.4.21 

(ii) : that a cotangent lifted left action has an equivariant momentum map; eq. 16.751 
in Section IB. 5. 21 
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(iii): that the momentum maps of the cotangent hfts of left and right translation 
on G are = T*Rg and 3r = T*Lg; eq. OHl and IFOTni in Section ESSl 

In particular, combining (i)-(iii): one deduces (exercise!) that Jji = T*Lg is equiv- 
ariant with respect to Ad*, and so Poisson with respect to the negative Lie-Poisson 
bracket (eq. I7.2[ s bracket) on g*. That is: it is Poisson with the codomain g*_. 

Thus we have the 

Lie-Poisson reduction theorem The diffeomorphism A : T*G/G g*: 

X : Orb(a) = {l3 e T*G \ (3 = T*Lh-i{a), some h e G} e T*G/G ^ {T*Lg){a) G g* 

(7.6) 

is Poisson. 

Proof. First, eq. 17.61 means we have a commutative triangle. For with tt : T*G 
T*G/G the canonical projection, the momentum map 3r : T*G 0*,ag ^ (T*Lg)ag 

is equal to A o tt: 

T*G ^ T*G/G ^ g* . (7.7) 

Since left translation is a diffeomorphism of G, and the cotangent lift of any diffeomor- 
phism of a manifold to its cotangent bundle is symplectic (cf. after eq. 14.51 in Section 
I4.11 A). the Poisson reduction theorem of Section 1^751 applies. That is, there is a unique 
Poisson structure on T*G/G such that vr is Poisson. We also know from eq. 16.751 16.551 
and l6.9(Jl that J/j = T*Lg is Poisson with respect to eq. I7.2t s bracket on g*. 

We can now deduce that A is Poisson, i.e. that for all x G T*G/G and all 
F,H eTig*_) 

({F, H},,_ o A) (x) = {FoXho X}t*g/g (x) . (7.8) 

We just use (in order) the facts that: 

(i) : 71 is surjective, so that for all x G T*G/G there is an ag G T*G with x = 
TT^ag) = Orb(ag); 

(ii) : = A o tt; 

(iii) : 3r is Poisson; and 

(iv) : TT is Poisson: 

({F, H},, o A) (x) = {F, H},. o (A o tt) (a,) (7.9) 

= {F,H},^oJji{ag) = {FojR,HojR}T*G{ag) (7.10) 

= {FoA,ifoA}r.G/G (^(«,)) = {FoX,HoX}T*G/Gix) . QED. (7.11) 

7.3 Meshing with the symplectic structure on T*G: invariant 
functions 

We turn to giving more information about the situation described by the Lie-Poisson 
reduction theorem. The general idea will be that the Lie-Poisson bracket on g* meshes 
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with the canonical symplectic structure on T*G. This will be made precise in two 
ways: the first is discussed in the first two Subsections, the second is discussed in the 
third Subsection. 

The first discussion will have three stages: 

(i) : we show that scalars on g*, F G ^(fl*), are in one-one correspondence with 
scalars on T*G that are constant on the orbits of the cotangent lift of left translation, 
which will be called left-invariant functions; and similarly, for the cotangent lift of right 
translation (a correspondence with right-invariant functions); 

(ii) : we take the usual canonical Poisson bracket in T*G of these left-invariant or 
right-invariant scalars; and restrict this bracket to g* regarded as the cotangent space 
T*G at the identity e E G; and then 

(iii) : we show that this restriction is the Lie- Poisson bracket on q*: the familiar 
positive one for n^f/it-invariant functions, and the new negative one of eq. 17.21 for the 
/e/t-invariant functions. 

We do stages (i) and (ii) in Section 17.3.11 These stages will not involve the choice 
between the positive and negative Lie- Poisson brackets. But stage (iii), in Section 
17.3.21 will involve this choice. It will be a one-liner corollary of Section I6.4.2f s result 
that equivariant momentum maps are Poisson maps, eq. 16.551 (unsurprisingly, in that 
we also used this result in Section ITT^ s proof of the reduction theorem). 

In the third Subsection, we use invariant functions to show a different sense in which 
the Lie-Poisson bracket on g* meshes with the symplectic structure on T*G. Namely, 
we derive the Lie-Poisson bracket on g* from the Poisson reduction theorem of Section 
15.51 by using the ideas of invariant functions and momentum functions. 

7.3.1 Left-invariant and right- invariant functions on T*G 

We say that a function F : T*G ^ IR is left-invariant if for all g E G, and all ag G T*G 

(FoT*L,)(a,) = F(a,) (7.12) 

where T*Lg is the cotangent lift of Lg : G ^ G. Similarly, F : T*G ^ IR is called 
right-invariant if for all g E G 

{FoT*Rg) = F. (7.13) 

So if F : T*G — > IR is left-invariant or right-invariant, it is determined by its values for 
arguments in T*G = g*. 

Since any a G g* is mapped by T*Lg-i = (T*Lg)'^ to an element of T*G, a func- 
tion is left-invariant iff it is constant on the orbits of the various T*Lg for g E G, 
i.e. constant on the orbits of the cotangent lift of left translation. Similarly, a function 
is right-invariant iff it is constant on the orbits of the cotangent lift of right translation. 

So left-invariant functions induce well-defined functions on the quotient space T*G/ G; 
and so, by Section 17^ on its diffeomorphic (indeed Poisson manifold) copy g*. Simi- 
larly for right-invariant functions. 
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But let us for the moment consider the smooth left-invariant (or right-invariant) 
functions on T*G, rather than the induced maps on the quotient space. We will denote 
the space of all smooth left-invariant functions on T*G by J-'l{T*G), and similarly the 
space of smooth right-invariant functions by J^ii{T*G). 

Recalling (from the discussion after eq. 14. 5|) that cotangent lifts are symplectic 
maps, i.e. T*Lg and T*Rg are symplectic maps on T*G, it follows immmediately that 
J^l{T*G) and J^r{T*G) are each closed under the canonical Poisson bracket on T*G. 
So they are each a Lie algebra with this bracket. 

Now we can use the momentum maps Jl and J/j of Example (3) of Section 16.5.31 
to extend any scalar F : g* ^ IR, i.e. F G jF(g*), to a left-invariant, or right-invariant, 
scalar on T*G. 

Thus, given F : g* ^ IR and G T*G, we define Fl E J^l{T*G) by 

FL{ag) := (F o J^)(a,) ^ (F o r;L,)(a,) . (7.14) 

So Fl is by construction left-invariant, and is called the left-invariant extension of F 
from g* to T*G. 

One similarly defines the right-invariant extension Fji G J-'r{T*G) of any F G ^(g*) 

by 

Fniag) := (F o J^)(a,) ^ (F o T;i?,)(a,) . (7.15) 

Then the maps 

F G ^(g*) ^FlE ^L(r*G) and F G ^(g*) ^ Fn e J^ni^G) (7.16) 

are vector space isomorphisms (exercise for the reader!) whose inverse is just restriction 
to the fiber T*G = g*. 

This completes what we called 'stages (i) and (ii)': describing a correspondence 
between scalars on g* and scalars on T*G that are constant on the orbits of the cotan- 
gent lifts of left and right translation; and considering the canonical Poisson bracket 
(on T*G) of these scalars, i.e. the Lie algebras J-'l(T*G) and J^r(T*G). 

7.3.2 Recovering the Lie-Poisson bracket 

We now do stage (iii): we show that the restriction of the canonical Poisson bracket on 
T*G of the right/left invariant functions, to g* regarded as the cotangent space T*G 
at the identity e G G, is the positive/negative Lie-Poisson bracket. 

Since the inverses of the maps eq. 17. 161 are just restriction to the fiber T*G = g*, it 
suffices to show that the maps eq. 17.161 are Lie algebra isomorphisms. More precisely: 

Recovery of the Lie-Poisson bracket Using the positive Lie-Poisson 
bracket on g* (we write g^): F h-> Fr is a Lie algebra isomorphism. 
Similarly: using the negative Lie-Poisson bracket on g* (we write gl): 
F I— > Fi is a Lie algebra isomorphism. 
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That is: for all F,H eJ^{Q*) 

{F, HU = {Fr, Hnh'G Is* ; {F, H}^ = {Fl, Hl}t'G lr (7-17) 

Proof. Consider : T*G ^ g* = 3l = T*Rg. 3l is an equivariant momentum 
map. So, by the result eq. 16.551 of Section IB. 4. 21 it is Poisson. That is: 

{F, H}+ oJl = {FoJl,Ho Jl}t.g = {Fr, Hr}t*g • (T.18) 

Restricting eq. 17.181 to g* gives the first equation of eq. 17.171 

Similarly, one proves the second equation by using the fact that 3r : T*G ^ g* = 
q^jJr = T*Lg is an equivariant momentum map and so is Poisson. That is: 

{F, H}. o = {F o J^, o 3ji}T,a = {Fl, Hl}t*g ■ (7.19) 
We then restrict eq. rTTTH to g*. QED. 

7.3.3 Deriving the Lie-Poisson bracket 

Our discussion so far, in both Section 17^ and the two previous Subsections, has taken 
the Lie-Poisson bracket (whether positive or negative) as given. We now show, using 
invariant functions and Section [6.5. U s idea of momentum functions, how to derive the 
Lie-Poisson bracket on g*. 

So this derivation will amount to another, more "constructive", proof of the Lie- 
Poisson reduction theorem. As in Section 17^ s proof, two main ingredients will be: 

(a) : the diffeomorphism A between T*G/G and g* (eq. 14. 1 1 71 or 17. 51 or 17^ . and 

(b) : the Poisson reduction theorem of Section 1331 applied to G"s action on T*G. 
But instead of Section 17^ s proof's using the facts that (i) the momentum maps 3r = 
T*Lg and Jl = T*Rg are equivariant and (ii) equivariant momentum maps are Poisson, 
we will now use the ideas of invariant functions and momentum functions. 

We begin by recalling that (since left translation is a diffeomorphism of G, and the 
cotangent lift of any diffeomorphism of a manifold to its cotangent bundle is symplec- 
tic), the Poisson reduction theorem implies that there is a unique Poisson structure 
on T*G/G such that vr : T*G — > T*G/G is Poisson. We now use the diffeomorphism 

A : T*G/G —>■ g* to transfer this Poisson structure to g*. Let us call the result {, }_. 
Though this is not to be read (yet!) as the negative Lie-Poisson bracket, our aim now 
is to calculate that it is in fact this bracket. 

Notice first that since the momentum map Jr : T*G g*, ag (T*Lg)ag is equal 

to A o TT (eq. 17. 7p . we know that is Poisson with respect to this induced bracket on 
g*. That is 

{F,H}^o3j,{ag) = {Fo3j,,Ho3j,}T,G{ag) = {Fl,Hl]t'G {ag) ■ (7.20) 



To calculate the right hand side, we will apply the ideas of invariant functions and 
momentum functions to each argument of the bracket; in particular to the first: 

Fz^K) = F(T;L, ■ ag) . (7.21) 

We observe that since a Poisson bracket depends only on the values of first deriva- 
tives, we can replace F G ^(fl*) by its linearization. That is, we can assume F is linear, 
so that at any point a G g*, F{a) =< a; VF >, where VF is a constant in g = q**. 
Applying this, and the definition of a momentum function eq. 16.591 to eq. \7.21\ we 
get: 

F(T;L, -ag) =< T:L, ■ag;VF> = < a^- T^Lg ■ VF > = ViXs7F)iag) , (7.22) 

where the last equation applies the definition of a momentum function to the left- 
invariant vector field on G, X^{g) = T^Lgi^^), for the case ^ = VF. 

Now we apply to eq. 17.221 in order: eq. 16.641 the definition of the Lie algebra 
bracket (cf. eq. 13.46|) . eq. 16.591 again, and the definition of left- invariant vector fields. 
We get: 

{FL,HL}T'G{ag) = {V{X^F),V{X^H)}T'G{ag) = -V{[X^F,X^H]){ag) (7.23) 

= -P(X[vF,v/f])(«g) = - < ag]X[^FyH]> (7.24) 
= -<ag;T,Lg{[\/F,\/H])>= - < T:Lg{ag)-[V F ,V H] > . (7.25) 

Combining eq. 17.2(Jl and eq. 17.251 and writing a G g* for (T*Lg)ag = Jji{ag), we have 
our result: 

{F, H}4a) = - <a; [VF, \/H] > . (7.26) 

One similarly derives the positive Lie-Poisson bracket by considering right-invariant 
extensions of linear functions. The minus sign coming from eq. 16. 641 is cancelled by the 
sign reversal in the Lie bracket of right-invariant vector fields. That is, it is cancelled 
by a minus sign coming from eq. 13.581 

7.4 Reduction of dynamics 

We end our account of the Lie-Poisson reduction theorem by discussing the reduction 
of dynamics from T*G to g*. 

We can be brief since we have already stated the main idea, when discussing the 
Poisson reduction theorem; cf. (2) (A) in Section 15.51 Thus recall that (under the 
conditions of the theorem) a G-invariant Hamiltonian function on a Poisson manifold 
M, if : M — >■ H, defines a corresponding function h on M/ G hj H = h o tt^ where vr 
is the projection vr : M — M/G; and since vr is Poisson, and so pushes Hamiltonian 
flows forward to Hamiltonian flows, vr pushes Xh on M to Xh on M/ G: 

TtioXh = Xhon . (7.27) 

Applying this, in particular eq. 17.271 to the Lie-Poisson reduction theorem, we get 
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Reduction of dynamics Let H : T*G — > ]R be left-invariant. Tliat is: 
the function H~ := if |g. on q* satisfies 

H{ag)=H-{3n{ag))=H-{T:Lg-ag) , ageT^G. (7.28) 

Then Jji pushes forward to X^-. Or in terms of the flows (pit) and 
4>~{t) of Xf{ and Xh~ respectively: 

JK(0(t)K)) = 0-(t)(JijK)) . (7.29) 

Similar statements hold for a right-invariant function H : T*G M., its 
restriction := if |g. and Jl = T*Rg. 

Besides, we already know the vector fleld of H~ on g*. For eq. l5.3Ul in (3) of Section 
15.2.41 gave a basis- independent expression of Hamilton's equations on q* in terms of ad*. 
We just need to note that since we are now using the negative Lie-Poisson bracket on 
Q*, all terms in the deduction (eq. I5.29|) apart from the left hand side, get a minus 
sign. So writing a G 0*, eq. 15.301 for the vector fleld Xh- becomes: 

— = - ad*^H-ia){(^) ■ (7-30) 

On the other hand, we can go in the other direction, reconstructing the dynamics 
on T*G from eq. 17.301 on q*. The statement of the main result, below, is intuitive, in 
that the "reconstruction equation" for g(t) E T*G is 

g-'g = VH- . (7.31) 

This is intuitive since it returns us to the basic idea of mechanics on g and g*, viz. that 
the map 

Xg-.geTgG^ Xg{g) ■= iTgLg^.)g e g (7.32) 

maps the generalized velocity to its body representation; cf. eq. 14.901 However, the 
proof of this result is involved (Marsden and Ratiu (1999: theorems 13.4.3, 13.4.4, p. 
423-426); so we only state the result. It is: — 

Reconstruction of dynamics Suppose given a Lie group G, a left-invariant 
H : T*G —>■ IR, its restriction := H |g., and an integral curve a{t) of the 
Lie-Poisson Hamilton's equations eq. 17.301 on g*, with the initial condition 
a{0) = T*Lg^^{agQ). Then the integral curve in T*G of Xh is given by 

T;(,)L,(,)-.(a(t)) ; (7.33) 

where g{t) is the solution of the reconstruction equation 

g-^ = Vii- (7.34) 

with initial condition (^(0) = g^. 



7.5 Envoi: the Marsden-Weinstein-Meyer theorem 



I emphasize that our discussion of reduction has only scratched the surface: after all 
this Section has been relatively short! But now that the reader is armed with the long 
and leisurely exposition from Section |21 onwards, they are well placed to pursue the 
topic of reduction; e.g. through this Chapter's main sources, Abraham and Marsden 
(1978), Arnold (1989), Olver (2000) and Marsden and Ratiu (1999). 

In particular, the reader can now relate the Lie-Poisson reduction theorem to an- 
other main theorem about symplectic reduction, usually called the Marsden-Weinstein- 
Meyer or Marsden- Weinstein theorem (after these authors' papers in 1973 and 1974). 

This theorem concerns a symplectic action of a Lie group G on a symplectic man- 
ifold {M,uj). For the sake of completeness, and to orient the reader to Landsman's 
discussion of this theorem (this vol., ch. 5, especially Section 4.5), it is worth stating it 
(as usual, for the finite-dimensional case only), together with the lemma used to prove 
it, and the ensuing reduction of dynamics. These statements will also round off our 
discussion by illustrating how some notions expounded from Section |21 onwards, but 
not used in this Section, are nevertheless useful — e.g. in stating the hypotheses of this 
theorem. 

So suppose the Lie group G acts symplectically (eq. 16. 1|) on the symplectic manifold 
{M,uj); and that J : M — > g* is an 74(i*-equivariant momentum map for this action 
(eq. 16.471 and I6.52|) . Assume also that a G g* is a regular value of J, i.e. that at every 
point X G J~^(q;), T^J is surjective. So the submersion theorem of (1) of Section 13.3.11 
applies; in particular, J~^(a) is a sub-manifold of M with dimension dim(M) - dim(0*) 
= dim(M) - dim(G). 

Let Ga be the isotropy group (eq. I4.33|) of a under the co-adjoint action, i.e. 



So since J is y4(i*-equivariant under Ga, the quotient space := 3~^{a)/Ga is well- 
defined. 

Now assume that G^ acts freely and properly on J~^(a), so that f Section I4.3I B) 
the quotient space Mq, = 3~^{a)/Ga is a manifold. is the reduced phase space 
(corresponding to the momentum value a). 

Now we assert: 

Marsden-Weinstein-Meyer theorem Ma has a natural symplectic form 
uja induced from (M, u) as follows. Let u, v be two vectors tangent to Ma 
at some point p G M^: so p is an orbit of Go's action on J~^(a), and 
M,f G TpMa- Then u and v are obtained, respectively, from some vectors 
u' and v' tangent to J^^(a) at some point x G J~^(a) of the orbit p, by the 
projection tTq : J^^(a) — ^ Ma- That is: 



Ga ■= {g E G \ Ad*-ia = a}. 



(7.35) 



Tna{u') = u ; Tna{v') = v . 



(7.36) 



It turns out that the value assigned by M's symplectic form uj is the same 
whatever choice of x, m', v' is made. So we define the symplectic form lo^ on 
Ma as assigning this value. In other words: writing tTq for the projection, 
ioL '■ J~^(a) M for the inclusion, and * for puUback: 

vt^lUq = • (7-37) 



The proof of this theorem uses the following Lemma. Let us write G ■ x for the 
orbit Orb(a;) of x under the action of all of G, and similarly Ga ■ x for the orbit under 
Ga, i.e. {^{g,x) \ g G Ga}- Then the Lemma states: 

For any x G 3~^{a): — 

(i) : T^iGa ■ x) = T^iG ■ x) n T^{J-\a)); and 

(ii) : Tx{G-x) and Tx{i~^ {a)) are cu-orthogonal complements of one another 
in TM. That is: for all u' E T^M: 

u' G T,(J-i(a)) iff uj{u', v') = for all v' G T^{G ■ x). 

Both the Lemma and the theorem are each proven in some dozen lines. For details, 
cf. Abraham and Marsden (1978: Theorems 4.3.1-2, p. 299-300), or Arnold (1989: 
Appendix 5.B, p. 374-376). 

Two final remarks. (1): The reduction of dynamics secured by the Marsden- 
Weinstein-Meyer theorem is similar to what we have seen before, for both the Poisson 
reduction theorem ((2) of Section l^75|) . and the Lie- Poisson reduction theorem (Section 
I7.4jl . One proves, again in a few lines (Abraham and Marsden (1978: Theorems 4.3.5, 
p. 304): 

Marsden- Weinstein-Meyer reduction of dynamics Let if : M — > IR 

be invariant under the action of G on M, so that by Noether's theorem 
for momentum maps f Section 16. 2j) J is conserved, i.e. J~^(a) is invariant 
under the flow 0(t) of Xh on M. Then 0(t) commutes with the action of 
Ga on J~^(a) (i.e. (pit) o = $g o (j)(t) for g G Ga), and so defines a flow 
0(t) on Ma such that tTq, o (j)(t) = 0(t) o tTq,, i.e. 

3'\a) Ma (7.38) 

</>(*) kt) 

3-\a) Ma 

The flow (f){t) is Hamiltonian with the Hamiltonian Ha defined by HaOT^a = 

H O ia. 

(2): I said at the start of this Subsection that the reader can now relate the Lie- 
Poisson reduction theorem to the Marsden- Weinstein-Meyer theorem. It is not hard 
to show that the former is an example of the latter. As the symplectic manifold M 
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one takes T*G, acted on symplectically by the cotangent lift of left translation. So we 
know (from (3) of Section [6.5.3|) that Jl := T*Rg is an 74(i*-equivariant momentum 
map ... and so on: I leave this as an exercise for the reader! The answer is supplied 
at Arnold (1989: 377, 321) and Abraham and Marsden (1978: 302). (Abraham and 
Marsden call it the 'Kirillov-Kostant-Souriau theorem'.) 

Suffice it to say here that this exercise gives another illustration of one of our central 
themes, that g*'s symplectic leaves are the orbits of the co-adjoint representation. For 
the reduced phase space is naturally identifiable with the co-adjoint orbit Orb(a) 
of a e g*, with the symplectic forms also naturally identified; (cf. also result (2) at the 
end of Section 13?^ . 
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